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Functional and Hyperfunctional siRNA 

5 Cross Reference to Related Applications 

This application claims the benefit of the filing date of U.S. Provisional 
Application Serial No. 60/426,137, filed November 14, 2002, entitled "Combinatorial 
Pooling Approach for siRNA Induced Gene Silencing and Methods for Selecting 
siRNA " and U.S. Provisional Application Serial No. 60/502,050, filed September 10, 
10 2003, entitled "Methods for Selecting siRNA" the entire disclosures of which are 
hereby incorporated by reference into the present disclosure. 

Field of Invention 

The present invention relates to RNA interference ("RNAi"). 

15 

Background of the Invention 

Relatively recently, researchers observed that double stranded RNA 
("dsRNA") could be used to inhibit protein expression. This ability to silence a gene 
has broad potential for treating human diseases, and many researchers and 
20 commercial entities are currently investing considerable resources in developing 
therapies based on this technology. 

Double stranded RNA induced gene silencing can occur on at least three 
different levels: (i) transcription inactivation, which refers to RNA guided DNA or 
25 histone methylation; (ii) siRNA induced mRNA degradation; and (iii) mRNA induced 
transcriptional attenuation. 

It is generally considered that the major mechanism of RNA induced silencing 
(RNA interference, or RNAi) in mammalian cells is mRNA degradation. Initial 
30 attempts to use RNAi in mammalian cells focused on the use of long strands of 

dsRNA. However, these attempts to induce RNAi met with limited success, due in 
part to the induction of the interferon response, which results in a general, as opposed 
to a target-specific, inhibition of protein synthesis. Thus, long dsRNA is not a viable 
option for RNAi in mammalian systems. 
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More recently it has been shown that when short (18-30 bp) RNA duplexes are 
introduced into mammalian cells in culture, sequence-specific inhibition of target 
mRNA can be realized without inducing an interferon response. Certain of these 

5 short dsRNAs, referred to as small inhibitory RNAs ("siRNAs"), can act catalytically 
at sub-molar concentrations to cleave greater than 95% of the target mRNA in the 
cell. A description of the mechanisms for siRNA activity, as well as some of its 
applications are described in Provost et al , Ribonuclease Activity and RNA Binding of 
Recombinant Human Dicer, E.M.B.O. J., 2002 Nov. 1; 21(21): 5864-5874; Tabara et 

10 al, The dsRNA Binding Protein RDE-4 Interacts with RDE-1, DCR-l and a DexH- 
boxHelicase to Direct RNAi in C elegans, Cell 2002, June 28;109(7):861-71; Ketting 
et al, Dicer Functions in RNA Interference and in Synthesis of Small RNA Involved in 
Developmental Timing in C. elegans; Martinez et al, Single-Stranded Antisense 

siRNAs Guide Target RNA Cleavage in RNAi f Cell 2002, Sept. 6; 110(5):563; 

i 

15 Hutvagner & Zamore, A microRNA in a multiple-turnover RNAi enzyme complex, 
Science 2002, 297:2056. 

From a mechanistic perspective, introduction of long double stranded RNA 
into plants and invertebrate cells is broken down into siRNA by a Type III 

20 endonuclease known as Dicer. Sharp, RNA interference — 2001, Genes Dev. 2001, 
15:485. Dicer, a ribonuclease-III-like enzyme, processes the dsRNA into 19-23 base 
pair short interfering RNAs with characteristic two base V overhangs. Bernstein, 
Caudy, Hammond, & Hannon, Role for a bidentate ribonuclease in the initiation step 
of RNA interference, Nature 2001, 409:363. The siRNAs are then incorporated into 

25 an RNA-induced silencing complex (RISC) where one or more helicases unwind the 
siRNA duplex, enabling the complementary antisense strand to guide target 
recognition. Nykanen, Haley, & Zamore, ATP requirements and small interfering 
RNA structure in the RNA interference pathway, Cell 2001, 107:309. Upon binding to 
the appropriate target mRNA, one or more endonucleases within the RISC cleaves the 

30 target to induce silencing. Elbashir, Lendeckel, & Tuschl, RNA interference is 
mediated by 21- and 22 -nucleotide RNAs, Genes Dev2001, 15:188, Figure 1. 

The interference effect can be long lasting and may be detectable after many 
cell divisions. Moreover, RNAi exhibits sequence specificity. Kisielow, M. et al 
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(2002) Isoform-specific knockdown and expression of adaptor protein ShcA using 
small interfering RNA, J. of Biochemistry 363: 1-5. Thus, the RNAi machinery can 
specifically knock down one type of transcript, while not affecting closely related 
mRNA. These properties make siRNA a potentially valuable tool for inhibiting gene 
5 expression and studying gene function and drug target validation. Moreover, siRNAs 
are potentially useful as therapeutic agents against: (1) diseases that are caused by 
over-expression or misexpression of genes; and (2) diseases brought about by 
expression of genes that contain mutations. 

10 Successful siRNA-dependent gene silencing depends on a number of factors. 

One of the most contentious issues in RNAi is the question of the necessity of siRNA 
design, i.e., considering the sequence of the siRNA used. Early work in C elegans 
and plants circumvented the issue of design by introducing long dsRNA (see, for 
instance, Fire, A. et al (1998) Nature 391 :806-81 1). In this primitive organism, long 

1 5 dsRNA molecules are cleaved into siRNA by Dicer, thus generating a diverse 

population of duplexes that can potentially cover the entire transcript. While some 
fraction of these molecules are non-functional (i.e. induce little or no silencing) one or 
more have the potential to be highly functional, thereby silencing the gene of interest 
and alleviating the need for siRNA design. Unfortunately, due to the interferon 

20 response, this.saaiie approach is unavailable for mammalian systems. While this effect 
can be circumvented by bypassing the Dicer cleavage step and directly introducing 
siRNA, this tactic carries with it the risk that the chosen siRNA sequence may be non- 
functional or semi-functional. ■ - r ' 

25 A number of researches have expressed the view that siRNA design is not a 

crucial element of RN Ai. On the other hand, others in the field have begun to explore 
the possibility that RNAi can be made more efficient by paying attention to the design, 
of the siRNA. Unfortunately, none of the reported methods have provided a 
satisfactory scheme for reliably selecting siRNA with acceptable levels of 

30 functionality. Accordingly, there is a need to develop rational criteria by which to 
select siRNA with an acceptable level of functionality, and to identify siRNA that 
have this improved level of functionality, as well as to identify siRNAs that are 
hyperfunctional. 
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Summary of the Invention 

The present invention is directed to increasing the efficiency of RNAi, 
particularly in mammalian systems. Accordingly, the present invention provides kits, 
siRNAs and methods for increasing siRNA efficacy. 

5 - 

According to one embodiment, the present invention provides a kit for gene 
silencing, wherein said kit is comprised of a pool of at least two siRNA duplexes, 
each of which is comprised of a sequence that is complementary to a portion of the 
sequence of one or more target messenger KNA. 

10 

According to a second embodiment, the present invention provides a method 
for optimizing RNA interference by using one or more siRNAs that are optimized 
according to a formula (or algorithm) selected from: 
Formula I 

1 5 Relative functionality of siRNA= -(GC/3) -KAU15-19) -(Tm 20 *c)*3 -(G n )*3 -(C19) 
+(A 19 )*2 +(A 3 ) +(U 10 )+(A 14 ) -(U 5 ) -(An) 

Formula II 

Relative functionality of siRNA= -(GC/3) -(AU 15 -i9)*3 -{Gi 3 )*3 -(C19) +(A i9 )*2 
20 +(A 3 ) 

Formula III 

Relative functionality of siRNA= -(GC/3) +(AU 15 -i 9 ) ^Tm 20 <>c)*3 - 

25 Formula IV 

Relative functionality of siRNA= 

-GC/2+( AUi 5 -i9)/2-(Tm 2 ooc)*2 -(G J3 )*3 -(C 19 ) +(A 19 )*2 +(A 3 ) +(Ui 0 )+(A 14 ) -(U 5 ) - 
(An) 

30 Formula V 

Relative functionality of siRNA-(G I3 )*3 ~(Ci 9 ) +(A 19 )*2 +(A 3 ) + (Ui 0 )+(A 14 ) -(U 5 ) 
-(A u ) 



Formula VI 
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Relative functionality of siRNA=-(Gi 3 )*3 -(C 19 ) +(A 19 )*2 +(A 3 ) 
Formula VII 

Relative functionality of siRNA=-(GC/2) +(AU 15 -i9)/2 -( Tm 2 o°c)*l -(Gn)*3 -(C19) 
5 +(A W )*3 +(A 3 )*3 +(U 10 )/2+(A !4 )/2 -(U 5 )/2 -(Aj i)/2 



wherein in Formulas I - VII: 

Tm 20°c=l if the Tm is greater than 20°C; 
10 A19 = 1 if A is the base at position 19 on the sense strand, otherwise its value 

is 0; 

AU15.19 = 0-5 depending on the number of A^Or U bases on the sense strand 

at 

positions 15-19; 

15 G13 = 1 if G is the base at position 1 3 on the sense strand, otherwise its value is 

0; 

C19 = 1 if C is the base at position 19 of the sense strand, otherwise its value is 

°; 

GC= the number of G and C bases in the entire sense strand; 
20 A 3 = 1 if A is the'base at position 3 on the sense strand, otherwise its value is 0; 

An= 1 if A is the base at position 11 on the sense strand, otherwise its value is 

0; 

Ah= 1 if A is the base at position 14 on the sense strand; otherwise its value is 

0; 

25 Uio= 1 if U is the base at position 10 on the sense strand, otherwise its value is 

0; 

Us= 1 if U is the base at position 5 on the sense strand, otherwise its value is 0; 

or 



3 0 Formula VIII Relative functionality of siRNA = 

(-14)*Gi3-13*Ai-12*U7-ll*U2-10*Aii-10*U 4 -10*C3-10*C5-10*C6- 

9*A,o-9*U 9 -9*C 1 8-8*G,o-7*Ui-7*Ui6-7*C I 7-7*Ci9 
+7*Ui7+8*A2+8*A4+8*A5+8*C 4 +9*G8+10*A7+10*U 1 8+ll :,: A 1 9+ 
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11*09+15*0!+ 18*A 3 +19*U 10 -Tm-3* (GCtotai)-6*(GC 15 -i9> 
30*X; and 

Formula DC Relative functionality of siRNA = 
5 (14.1)*A 3 +(14.9)*A6+(17.6)*Ai3+(24.7)*Ai9+(14.2)*U 10 +(10.5)* 

C 9 +(23.9)*Gi+(16.3)*G 2 +(-123)*An+(-19.3)*Ui+(-12.1)*U2+ 
(-1 l)*U 3 +(-15.2) *Ui5+(-l 1 .3)*Ui 6 + (-1 1 .8) *C 3 +(-17.4)*C 6 +(- 
10.5)*Ct+ (-13.7)*Gi3+(-25.9)*G 19 -Tm-3* (GC to taO - 6*(GCi 5 -i 9 )- 
30*X 

1 0 wherein 

Ai = 1 if A is the base at position 1 of the sense strand, otherwise its value is 0; 

A 2 = 1 if A is the base at position 2 of the sense strand, b'therwise its value is 0; 

A 3 = 1 if A is the base at position 3 of the sense strand, otherwise its value is 0; 

A4 = 1 if A is the base at position 4 of the sense strand, otherwise its value is 0; 
15 A 5 = 1 if A is the base at position 5 of the sense strand, otherwise its value is 0; 

Ae = 1 if A is the base at position 6 of the sense strand, otherwise its value 'is 0; 

A 7 = 1 if A is the base at position 7 of the sense strand, otherwise its value is 0; 

A10 = 1 if A is the base at position 10 of the sense strand, otherwise its value is 0; 

An = 1 if A is the base at position 1 1 of the sense strand, otherwise its value is 0; 
20 A13 = 1 if A is the base atrpbsition 13 of the sense strand, otherwise its value is 0; 

Ai9= 1 if A is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 18 base pairs in length, its value is 0; 

C 3 = 1 if C is the base at position 3 of the sense strand, otherwise its value is 0; 
25 C4 = 1 if C is the base at position 4 of the sense strand, otherwise its value is 0; 

C 5 = 1 if C is the base at position 5 of the sense strand, otherwise its value is 0; 

C 6 = 1 if C is the base at position 6 of the sense strand, otherwise its value is 0; 

C 7 = 1 if C is the base at position 7 of the sense strand, otherwise its value is 0; 

C 9 = 1 if C is the base at position 9 of the sense strand, otherwise its value is 0; 
30 Cn = 1 if C is the base at position 17 of the sense strand, otherwise its value is 0; 

Cis = 1 if C is the base at position 1 8 of the sense strand, otherwise its value is 0; 

C19 = 1 if C is the base at position 1 9 of the sense strand, otherwise if another base is 
present or the sense strand is only 18 base pairs in length, its value is 0; 
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Gi = 1 if G is the base at position 1 on the sense strand, otherwise its value is 0; 
G2 = 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 
Gg = 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 
Gio= 1 if G is the base at position 10 on the sense strand, otherwise its value is 0; 
5 G13 = I if G is the base at position 13 on the sense strand, otherwise its value is 0; 

G19 = 1 if G is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 1 8 base pairs in length, its value is 0; 

Ui = 1 if U is the base at position 1 on the sense strand, otherwise its value is 0; 
10 U 2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0; 

U3 = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0; 

U4 = 1 if U is the base at position 4 on the sense strand, otherwise its value is 0; 

U 7 = 1 if U is the base at position 7 on the sense strand, otherwise its value is 0; 

U 9 = 1 if U is the base at position 9 on the sense strand, otherwise its value is 0; 
15 U10 = 1 if U is the base at position 10 on the sense strand, otherwise its value is 0; 

U15 = 1 if U is the base at position 15 on the sense strand, otherwise its value is 0; 

Ui6 = 1 if U is the base at position 16 on the sense strand, otherwise its value is 0; 

Un = 1 if U is the base at position 17 on the sense strand, otherwise its value is 0; 

Ui8 = 1 if U is the base at position 1 8 on the sense strand, otherwise its value is 0; 

20 -rr' 

GC15-19 = the number of G and C bases within positions 15 - 19 of the sense strand 

or within positions 15 -18 if the sense strand is only 18 base pairs in length; 
GCtotai = the number of G and C bases in the sense strand; 
Tm= 100 if the targeting site contains an inverted repeat longer than 4 base pairs, 
25 otherwise its value is 0; and 

X = the number of times that the same nucleotide repeats four or more times in a row. 

According to a third embodiment, the present invention is directed to a kit 
comprised of at least one siRNA that contains a sequence that is optimized according 
30 to one of the formulas above. Preferably the kit contains at least two optimized 
siRNA, each of which comprises a duplex, wherein one strand of each duplex 
comprises at least eighteen contiguous bases that are complementary to a region of a 
target messenger RNA. For mammalian systems, the siRNA preferably comprises 
between 18 and 30 nucleotide base pairs. 
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The ability to use the above algorithms, which are not sequence or species 
specific, allows for the cost-effective selection of optimized siRNAs for specific 
target sequences. Accordingly, there will be both greater efficiency and reliability in 
5 the use of siRNA technologies. 



According to a fourth embodiment, the present invention provides a method 
for developing an siRNA algorithm for selecting functional and hyperfunctional 
siRNAs for a given sequence. The method comprises: 
10 ( a ) selecting a set of siRNAs; 

(b) measuring the gene silencing ability of each siRNA from said set; 

(c) determining the relative functionality of each siRNA; 

(d) determining the amount of improved functionality by the presence or 
absence of at least one variable selected from the group consisting of 

1 5 the total GC content, melting temperature of the siRNA, GC content at 

positions 15—19, the presence or absence of a particular nucleotide at a 
particular position and the number of times that the same nucleotide 
repeats within a given sequence; and 

(e) developing an algorithm using the information of step (d). 

20 

According to this embodiment, preferably the set of siRNAs comprises at least 
90 siRNAs from at least one gene, more preferably at least 180 siRNAs from at least 
two different genes, and most preferably at least 270 and 360 siRNAs from at least 
three and four different genes, respectively. Additionally, in step (d) the 
25 determination is made with preferably at least two, more preferably at least three, 
even more preferably at least four, and most preferably all of the variables. The 
resulting algorithm is not target sequence specific. 



In a fifth embodiment, the present invention provides rationally designed 
30 siRNAs identified using the formulas above. 



In a sixth embodiment, the present invention is directed to hyperfunctional 

siRNA. 
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For a better understanding of the present invention together with other and 
farther advantages and embodiments, reference is made to the following description 
taken in conjunction with the examples, the scope of which is set forth in the 
appended claims. 

5 

Brief Description of the Figures 

Figure 1 shows a model for siRNA-RISC interactions. RISC has the ability to interact 
with either end of the siRNA or miRNA molecule. Following binding, the duplex is 
unwound, and the relevant target is identified, cleaved, and released. 

10 

Figure 2 is a representation of the functionality of two hundred and seventy siRNA 
duplexes that were generated to target human cyclophilin, human diaz^pam-binding 
inhibitor (DB), and firefly luciferase. 

1 5 Figure 3 a is a representation of the silencing effect of 30 siRNAs in three different 
cells lines, HEK293, DU145, and Hela. Figure 3b shows the frequency of different 
functional groups (>95% silencing (black), >80% silencing (gray), >50% silencing 
(dark gray), and <50% silencing (white)) based on GC content. In cases where a given 
bar is absent from a particular GC percentage, no siRNA were identified for that 

20 particular group. Figure 3c shows theirequency of different functional groups based 
on melting temperature (Tm). Again, each group has four different divisions: >95% 
(black), >80% (gray), >50% (dark gray), and <50% (white) silencing. 

Figure 4 is a representation of a statistical analysis that revealed correlations between 
25 silencing and five sequence-related properties of siRNA: (A) an A at position 19 of 
the sense strand, (B) an A at position 3 of the sense strand, (C) a U at position 10 of 
the sense strand, (D) a base other than G at position 13 of the sense strand, and (E) a 
base other than C at position 19 of the sense strand. All variables were correlated with 
siRNA silencing of firefly luciferase and human cyclophilin. SiRNAs satisfying the 
30 criterion are grouped on the left (Selected) while those that do not, are grouped on the 
right (Eliminated). Y-axis is "% Silencing of Control." Each position on the X-axis 
represents a unique siRNA. 
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Figures 5 A and 5 B are representations of firefly hiciferase and cyclophilin siRNA 
panels sorted according to functionality and predicted values using Formula VIIL The 
siRNA found within the circle represent those that have Formula VIII values 
(SMARTscores™) above zero. SiRNA outside the indicated area have calculated 
5 Formula VIII values that are below zero. Y-axis is "Expression (% Control)." Each 
position on the X-axis represents a unique siRNA. 

- Figure 6A is a representation of the average internal stability profile (AISP) derived 
from 270 siRNAs taken from three separate genes (cyclophilin B, DBI and firefly 
10 hiciferase). Graphs represent AISP values of highly functional, functional, and non- 
functional siRNA. Figure 66 is a comparison between the AISP of naturally derived 
GFP siRNA (filled squares) and the AISP of siRNA from cyclophilin B,£)BI, and 
luciferase having >90% silencing properties (no fill) for the antisense strand. "DG" is 
the symbol for AG, free energy. 

15 

Figure 7 is a histogram showing the differences in duplex functionality upon 
introduction of baisepair mismatches. The X-axis shows the mismatch introduced inot 
the siRNA and the position it is introduced (e.g., 8C->A reveals that position 8 (which 
normally has a C) has been changed to an A). The Y-axis is "% Silencing 
20 (Normalized to Control)." ~' r 

Figure 8a is histogram that shows the effects of 5 5 sense and antisense strand 
- - ~ modification with 2'-0-methylation on functionality. Figure 8b is an expression - 

profile showing a comparison of sense strand off-target effects for IGF1R-3 and 2'-0- 
25 methyl IGF1R-3. Sense strand off-targets (lower white box) are not induced when the 
5' end of the sense strand is modified with 2'-0-methyl groups (top white box). 

Figure 9 shows a graph of SMARTscores™ versus RNAi silencing values for more 
than 360 siRNA directed against 30 different genes. SiRNA to the right of the vertical 
30 bar represent those siRNA that have desirable SMARTscores™. 

Figures 10A - E compare the RNAi of five different genes (SEAP, DBI, PLK, 
Firefly Luciferase, and Renila Luciferase) by varying numbers of randomly selected 
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siRNA and four rationally designed (SMART-selected) siRNA chosen using the 
algorithm described in Formula VIIL In addition, RNAi induced by a pool of the four 
SMART-selected siRNA is reported at two different concentrations (100 and 400nM). 
10F is a comparison between a pool of randomly selected EGFR siRNA (Pool 1) and 
5 a pool of SMART selected EGFR siRNA (Pool 2). Pool 1 , SI— S4 and Pool 2 SI— 
S4 represent the individual members that made up each respective pool. Note that 
numbers for random siRNAs represent the position of the 5' end of the sense strand of 
the duplex. The Y-axis represents the % expression of the control(s). The X-axis is 
the percent expression of the control. 

10 

Figure 11 shows the Western blot results from cells treated with siRNA directed 
against twelve different genes involved in the clathrin-dependent endocytosis " 
pathway (CHC 5 Dynll, CALM, CLCa, CLCb, EpslS, EpslSR, Rab5a, Rab5b, Rab5c, 

P2 subunit of AP-2 and EEA.l). SiRNA were selected using Formula VIII. "Pool" 

/ 

15 represents a mixture of duplexes 1-4. Total concentration of each siRNA in the pool . 
is 25 nM. Total concentration = 4 x 25 = 100 nM. 

M 

\ 

Figure 12 is a representation of the gene silencing capabilities of rationally-selected 
siRNA directed against ten different genes (human and mouse cyclophilin, C-myc, 
20 human lamin A/C, QB (ubiquinol-cytochromeTc reductase core protein I), MEK1 and 
MEK2, ATE1 (arginyl-tRNA protein transferase), GAPDH, and Eg5). The Y-axis is 
the percent expression of the control. Numbers 1,2,3 and 4 represent individual 
rationally selected siRNA. "Pool" represents a mixture of the four individual siRNA. 

25 Figure 13 is the sequence of the top ten Bcl2 siRNAs as determined by Formula VIII. 
Sequences are listed 5 5 to 3'. 

Figure 14 is the knockdown by the top ten Bcl2 siRNAs at lOOnM concentrations. 
The Y-axis represents the amount of expression relative to the non-specific (ns) and 
30 transfection mixture control. 



Figure 15 represents a functional walk where siRNA beginning on every other base 
pair of a region of the luciferase gene are tested for the ability to silence the luciferase 
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gene. The Y-axis represents the percent expression relative to a control. The X-axis 
represents the position of each individual siRNA. 



10 



15 



20 



25 



30 



Figure 16 is a histogram demonstrating the inhibition of target gene expression by 
pools of 2 and 3 siRNAs duplexes taken from the walk described in Figure 15. The 
Y-axis represents the percent expression relative to control. The X-axis represents the 
position of the first siRNA in paired pools, or trios of siRNA. For instance, the first 
paired pool contains siRNA 1 and 3. The second paired pool contains siRNA 3 and 5. 
Pool 3 (of paired pools) contains siRNA 5 and 7, and so on. 



Figure 17 is a histogram demonstrating the inhibition of target gene expression by 
pools of 4 and 5 siRNA duplexes. The Y-axis represents the percent expression - - " 
relative to a control. The X-axis represents the positoin of the first siRNA in each 
pool. 



Figure 18 is a histogram demonstrating the inhibition of target gene expression by 
siRNAs that are ten and twenty basepairs apart. The Y-axis represents the percent 
expression relative to a control. The X-axis represents the position of the first siRNA 
in each pool. 



Figure 19 shows that pools of siRNAs (dark gray bar) work as well (or better) than 
the best siRNA in the pool (light gray bar). The Y-axis represents the percent 
expression relative to a control. The-X axis represents the position of the first siRNA 
in each pool. 



Figure 20 shows that the combination of several semifunctional siRNAs (dark gray) 
result in a significant improvement of gene expression inhibition over individual 
(semi-functional; light gray) siRNA. The Y-axis represents the percent expression 
relative to a control. 



Figure 21 shows both pools (Library, Lib) and individual siRNAs in inhibition of 
gene expression of Beta-Galactosidase, Renilla Luciferase and SEAP (alkaline 
phosphatase). Numbers on the X-axis indicate the position of the 5'-most nucleotide 
of the sense strand of the duplex. The Y-axis represents the percent expression of 
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each gene relative to a control. Libraries contain siRNAs that begin at the following 
nucleotides: Seap: Lib 1: 206, 766, 812,923, Lib 2: 1117, 1280, 1300, 1487, Lib 3: 
206, 766, 812, 923, 1117, 1280, 1300,1487, Lib 4: 206, 812, 1117, 1300, Lib 5: 766, 
923, 1280, 1487, Lib 6: 206, 1487; Bgal: Lib 1: 979, 1339, 2029, 2590, Lib 2: 
5 1087,1783,2399,3257, Lib 3: 979, 1783, 2590, 3257, Lib 4: 979, 1087, 1339, 1783, 
2029, 2399,2590,3257, Lib 5: 979, 1087, 1339, 1783, Lib 6: 2029,2399,2590,3257; 
Renilla: Lib 1 : 174,300,432,568, Lib 2: 592, 633, 729,867, Lib 3: 174,300,432, 
568, 592, 633,729,867, Lib 4: 174,432,592,729, Lib 5: 300,568,633,867, Lib 6: 
592,568. 

10 

Figure 22 showS the results of an EGFR and TfiiR internalization assay when single 
gene knockdowns are performed. The Y-axis represents percent internalization 
relative to control. 

15 

Figure 23 shows the results of an EGFR and TfiiR internalization assay when 
multiple genes axe knocked down (e.g. Rab5a, b, c). The Y-axis represents the 
percent internalization relative to control. 

20 Figure 24 shows the simultaneous knockdown of four different genes. SiRNAs 

directed against G6PD, GAPDH, PLK, and UBQ were simultaneously introduced into 
cells. Twenty-four hours later, cultures were harvested and assayed for mRNA target 
> levels-, fon^each of the four genes. A comparison is made between cells transfected 
with individual siRNAs vs. a pool of siRNAs directed against all four genes. 

25 

Figure 25 shows the functionality of ten siRNAs at 0.3nM concentrations. 

Detailed Description 
Definitions 

30 Unless stated otherwise, the following terms and phrases have the meanings 

provided below: 
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siRNA 

The term "siRNA" refers to small inhibitory RNA duplexes that induce the 
RNA interference (RNAi) pathway. These molecules can vary in length (generally 
between 18-30 basepairS) and contain varying degrees of complementarity to their 
target mRNA in the antisense strand. Some, but not all, siRNA have unpaired 
overhanging bases on the 5' or 3' end of the sense strand and/or the antisense strand. 
The term "siRNA" includes duplexes of two separate strands, as well as single strands 
that can form hairpin structures comprising a duplex region. 



10 SiRNA may be divided into five (5) groups (non-functional, semi-functional, 

functional, highly functional, and hyper-functional) based on the level or degree of 
silencing that they induce in cultured cell lines. As used herein, these definitions are 
based on a set of conditions where the siRNA is transfected into said cell line at a 
concentration of lOOnM and the level of silencing is tested at a time of roughly 24 

1 5 hours after transfection, and not exceeding 72 hours after transfection. In this context, 
"non-functional siRNA" are defined as those siRNA that induce less than 50% 
(<50%) target silencing. "Semi-functional siRNA" induce 50-79% target silencing. 
"Functional siRNA" are molecules that induce 80-95% gene silencing. "Highly- 
functional siRNA" are molecules that induce greater than 95% gene silencing. 

20 "Hyperfunctional siRNA" are a special class of molecules: For purposes of this 
document, hyperfunctional siRNA are defined as those molecules that: (1) induce 
greater than 95% silencing of a specific target when they are transfected at 
subnanomol-ar concentrations {i.e., less than one nanomolar); and/or (2) induce 
functional (or better) levels of silencing for greater than 96 hours. These relative 

25 functionalities (though not intended to be absolutes) may be used to compare siRNAs 
to a particular target for applications such as functional genomics, target identification 
and therapeutics. 



miRNA 

30 The term "miRNA" refers to microRNA. 



Gene silencing 

The phrase "gene silencing" refers to a process by which the expression of a 
specific gene product is lessened or attenuated. Gene silencing can take place by a 
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variety of pathways. Unless specified otherwise, as used herin, gene silencing refers 
to decreases in gene product expression that results from RNA interference (RNAi), a 
defined, though partially characterized pathway whereby small inhibitory RNA 
(siRNA) act in concert with host proteins {e.g. the RNA induced silencing complex, 
5 RISC) to degrade messenger RNA (mRNA) in a sequence-dependent fashion. The 
. level of gene silencing can be measured by a variety of means, including, but not 
limited to, measurement of transcript levels by Northern Blot Analysis, B-DNA 
techniques, transcription-sensitive reporter constructs, expression profiling {e.g. DNA 
chips), and related technologies. Alternatively, the level of silencing can be measured 
10 by assessing the level of the protein encoded by a specific gene. This can be 
accomplished by performing a number of studies including Western Analysis, 
measuring the levels of expression of a reporter protein that has e.g. fluorescent 
properties {e.g. GFP) or enzymatic activity {e.g. alkaline phosphatases), or several 
other procedures. 

15 ' 

Transfection 

The term "transfection" refers to a process by which agents are introduced into 
a cell. The list of agents that can be transfected is large and includes, but is not 
limited to, siRNA, sense and/or anti-sense sequences, DNA encoding one or more 
20 genes and organized into an expression plasmid, proteins, proieiii fragments, and 

more. There are multiple methods for transfecting agents into a cell including, but not 
limited to, electroporation, calcium phosphate-based transfections, DEAE-dextran- 
based transfections, lipid-based transfections, molecular conjugate-based transfections 
{e.g. polylysine-DNA conjugates), microinjection and others. 

25 

Target 

The term "target" is used in a variety of different forms throughout this 
document and is defined by the context in which it is used. "Target mRNA" refers to 
a messenger RNA to which a given siRNA can be directed against. "Target sequence" 
30 and "target site" refer to a sequence within the mRNA to which the sense strand of an 
siRNA shows varying degrees of homology and the antisense strand exhibits varying 
degrees of complementarity. The term "siRNA target" can refer to the gene, mRNA, 
or protein against which an siRNA is directed. Similarly "target silencing" can refer 
to the state of a gene, or the corresponding mRNA or protein. 
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Off-target silencing and Off-target interference 

The phrases "off-target silencing" and "off-target interference" are defined as 
degradation of mRNA other than the intended target mRNA due to overlapping and/or 
partial homology with secondary mRNA messages. 



SMARTscore™ 

The term "SMARTscore™" refers to a number determined by applying any of 
the Formulas I - Formula IX to a given siRNA sequence. The term "SMART- 
10 selected" or "rationally selected" or "rational selection" refers to siRNA that have 
been selected on the basis of their SMARTscores™. 



Complementary 

The term "complementary" refers to the ability of polynucleotides to form 
1 5 base pairs with one another. Base pairs are typically formed by hydrogen bonds 
between nucleotide units in antiparallel polynucleotide strands. Complementary 
polynucleotide strands can base pair in the Watson-Crick manner (e.g., A to T, A to 
U, C to G), or in any other manner that allows for the formation of duplexes. As 
persons skilled in the art are aware, when using RNA as opposed to DNA, uracil 
20 rather than thymine is the base that is considered to be complementary to adenosine. 
However, when a U is denoted in the context of the present invention, the ability to 
substitute a T is implied, unless otherwise stated. 



Perfect complementarity or 100% complementarity refers to the situation in 
25 which each nucleotide unit of one polynucleotide strand can hydrogen bond with a 
nucleotide unit of a second polynucleotide strand. Less than perfect complementarity 
refers to the situation in which some, but not all, nucleotide units of two strands can 
hydrogen bond with each other. For example, for two 20-mers, if only two base pairs 
on each strand can hydrogen bond with each other, the polynucleotide strands exhibit 
30 10% complementarity. In the same example, if 1 8 base pairs on each strand can 
hydrogen bond with each other, the polynucleotide strands exhibit 90% 
complementarity. "Substantial complementarity" refers to polynucleotide strands 
exhibiting 79% or greater complementarity, excluding regions of the polynucleotide 
strands, such as overhangs, that are selected so as to be noncomplementary. 
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("Substantial similarity 5 ' refers to polynucleotide strands exhibiting 79% or greater 
similarity, excluding regions of the polynucleotide strands, such as overhangs, that are 
selected so as not to be similar.) Thus, for example, two polynucleotides of 29 
nucleotide units each, wherein each comprises a di-dT at the 3 5 terminus such that the 
5 duplex region spans 27 bases, and wherein 26 of the 27 bases of the duplex region on 
each strand are complementary, are substantially complementary since they are 96.3% 
complementary when excluding the di-dT overhangs. 



Deoxvnucleotide 

1 0 The term "deoxynucleotide" refers to a nucleotide or polynucleotide lacking a 

hydroxyl group (OH group) at the V and/or 3' position of a sugar moiety. Instead, it 
has a hydrogen bonded to the T and/or 3' carbon. Within an RNA molecule that 
comprises one or more deoxynucleotides, "deoxynucleotide" refers to the lack of an 
OH group at the 2' position of the sugar moiety, having instead a hydrogen bonded 

1 5 directly to the 2 ' carbon. 

Deoxvribonucleotide 

The terms "deoxyribonucleotide" and "DNA" refer to a nucleotide or 
polynucleotide comprising at least one sugar moiety that has an H, rather than an OH, 
20 at its 2 5 and/or 3'position. - ^ 

Substantially Similar 

The phrase^substantially similar" refers to a similarity of at least 90% with 
respect to the identity of the bases of the sequence. 

25 

Duplex Re gion 

The phrase "duplex region" refers to the region in two complementary or 
substantially complementary polynucleotides that form base pairs with one another, 
either by Watson-Crick base pairing or any other manner that allows for a stabilized 
30 duplex between polynucleotide strands that are complementary or substantially 

complementary. For example, a polynucleotide strand having 21 nucleotide units can 
base pair with another polynucleotide of 21 nucleotide units, yet only 19 bases on 
each strand are complementary or substantially complementary, such that the "duplex 
region" has 1 9 base pairs. The remaining bases may, for example, exist as 5' and 3' 
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overhangs. Further, within the duplex region, 100% complementarity is not required; 
substantial complementarity is allowable within a duplex region. Substantial 
complementarity refers to 79% or greater complementarity. For example, a mismatch 
in a duplex region consisting of 19 base pairs results in 94.7% complementarity, 
5 rendering the duplex region substantially complementary. 

Nucleotide 

The term "nucleotide" refers to a ribonucleotide or a deoxyribonucleotide or 
modified form thereof, as well as an analog thereof. Nucleotides includ_e species that 
10 comprise purines, e.g., adenine, hypoxanthine, guanine, and their derivatives and 
analogs, as well as pyrimidines, e.g., cytosine, uracil, thymine, and their derivatives 
and analogs; 

Nucleotide analogs include nucleotides having modifications in the chemical 
1 5 structure of the base, sugar and/or phosphate, including, but not limited to, 5-position 
pyrimidine modifications, 8-position purine modifications, modifications at cytosine 
exocyclic amines, and substitution of 5-bromo-uracil; and 2'-position sugar 
modifications, including but not limited to, sugar-modified ribonucleotides in which 
the 2'-OH is replaced by a group such as an H, OR, R, halo, SH, SR, NH 2 , NHR, 
20 NR2, or CN, wherein R is an alkyl moiety. Nucleotide analogs are also meant to 
include nucleotides with bases such as inosine, queuosine, xanthine, sugars such as 
2 , -methyl ribose, non-natural phosphodiester linkages such as methylphosphonates, 
phosphoro thioates and-peptides . 

25 Modified bases refer to nucleotide bases such as, for example, adenine, 

guanine, cytosine, thymine, uracil, xanthine, inosine, and queuosine that have been 
modified by the replacement or addition of one or more atoms or groups. Some 
examples of types of modifications that can comprise nucleotides that are modified 
with respect to the base moieties include but are not limited to, alkylated, halogenated, 

30 thiolated, aminated, amidated, or acetylated bases, individually or in combination. 
More specific examples include, for example, 5-propynyluridine, 5-propynylcytidine, 
6-methyladenine, 6-methylguanine, N,N,-dimethyladenine, 2-propyladenine, 2- 
propylguanine, 2-aminoadenine, 1-methylinosine, 3-methyluridine, 5-methylcytidine, 
5-methyluridine and other nucleotides having a modification at the 5 position, 5-(2- 
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amino)propyl uridine, 5-halocytidine, 5~halouridine, 4-acetylcytidine, 1- 
methyladenosine, 2-methyladenosine 5 3-methylcytidine, 6-methyluridine 5 2- 
methylguanosine, 7-methylguanosine, 2,2-dimethylguanosine, 5- 
methylaminoethyluridine, 5-methyloxyuridine, deazanucleotides such as 7-deaza- 

5 adenosine, 6-azouridine, 6-azocytidine, 6-azothymidine, 5-methyl-2-thiouridine, other 
thio bases such as 2-thiouridine and 4-thiouridine and 2-thiocytidine, dihydrouridine, 
pseudouridine, queuosine, archaeosine, naphthyl and substituted naphthyl groups, any 
O- and N-alkylated purines and pyrimidines such as N6-methyladenosine, 5- 
methylcarbonylmethyluridine, uiidine 5-oxyacetic acid, pyridine-4-one, pyridine-2- 

10 one, phenyl and modified phenyl groups such as aminophenol or 2,4,6-trimethoxy 
benzene, modified cytosines that act as G-clamp nucleotides, 8-substituted adenines 
and guanines, 5-substituted uracils and thymines, azapyrimidines, 
carboxyhydroxyalkyl nucleotides, carboxyalkylaminoalkyl nucleotides, and 
alkylcarbonylalkylated nucleotides. Modified nucleotides also include those 

1 5 nucleotides that are modified with respect to the sugar moiety, as well as nucleotides 
having sugars or analogs thereof that are not ribosyl. For example, the sugar moieties 
maybe, or be based on, mannoses, arabinoses, glucopyranoses, galactopyranoses, 4'- 
thioribose, and other sugars, heterocycles, or carbocycles. 

20 The term nucleotide is also meant to include what are known in the art; as 

universal bases. By way of example, universal bases include but are not limited to 3- 
nitropyrrole, 5-nitroindole, or nebularine. The term "nucleotide" is also meant to 
include the N3' to PSNphQspher-amidate, resulting from the substitution of a ribosyl 3' 
oxygen with an amine group. 

25 

Further, the term nucleotide also includes those species that have a detectable 
label, such as for example a radioactive or fluorescent moiety, or mass label attached 
to the nucleotide. 

30 Polynucleotide 

The term "polynucleotide" refers to polymers of nucleotides, and includes but 
is not limited to DNA, RNA, DNA/RNA hybrids including polynucleotide chains of 
regularly and/or irregularly alternating deoxyribosyl moieties and ribosyl moieties 
(ue. , wherein alternate nucleotide units have an -OH, then and -H, then an -OH, then 
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an -H, and so on at the 2' position of a sugar moiety), and modifications of these 
kinds of polynucleotides, wherein the attachment of various entities or moieties to the 
nucleotide units at any position are included. 



Polyribonucleotide 

The term "polyribonucleotide" refers to a polynucleotide comprising two or 
more modified or unmodified ribonucleotides and/or their analogs. The term 
"polyribonucleotide" is used interchangeably with the term "oligoribonucleotide." 



10 Ribonucleotide and ribonucleic acid 

The term "ribonucleotide" and the phrase "ribonucleic acid" (RNA), refer to a 
modified or unmodified nucleotide or polynucleotide comprising at least one 
ribonucleotide unit. A ribonucleotide unit comprises an hydroxyl group attached to 
the 2 5 position of a ribosyl moiety that has a nitrogenous base attached in N- 

1 5 glycosidic linkage at the 1' position of a ribosyl moiety, and a moiety that either 
allows for linkage to another nucleotide or precludes linkage. 



Detailed Description of the Invention 

20 The present invention is directed to improving the efficiency of gene silencing 

by siRNA. Through the inclusion of multiple siRNA sequences that are targeted to a 
particular gene and/or selecting an siRNA sequence based on certain defined criteria, 
improved efficiency may be achieved. 



25 The present invention will now be described in connection with preferred 

embodiments. These embodiments are presented in order to aid in an understanding 
of the present invention and are not intended, and should not be construed, to limit the 
invention in any way. All alternatives, modifications and equivalents that may 
become apparent to those of ordinary skill upon reading this disclosure are included 

30 within the spirit and scope of the present invention. 



Furthermore, this disclosure is not a primer on RNA interference. Basic 
concepts known to persons skilled in the art have not been set forth in detail. 
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Optimizing siRNA 

According to one embodiment, the present invention provides a method for 
improving the effectiveness of gene silencing for use to silence a particular gene 
through the selection of an optimal siRNA. An siRNA selected according to this 
method may be used individually, or in conjunction with the first embodiment, i.e., 
with one or more other siRNAs, each of which may or may not be selected by this 
criteria in order to maximize their efficiency. 



The degree to which it is possible to select an siRNA for a given mRNA that 
1 0 maximizes these criteria will depend on the sequence of the mRNA itself However, 
the selection criteria will be independent of the target sequence. According to this 
method, an siRNA is selected for a given gene by using a rational design. That said, 
rational design can be described in a variety of ways. Rational design is, in simplest 
terms, the application of a proven set of criteria that enhance the probability of 
1 5 identifying a functional or hyperfunctional siRNA. In one method, rationally 
designed siRNA can be identified by maximizing one or more of the following 
criteria: 



I . A low GC content, preferably between about 30 -52%. 

20 2. At least 2, preferably at least 3 A or U bases at positions 15- 19 of the 

siRNA on the sense strand. 

3. An A base at position 1 9 of the sense strand. 

4. An A base at position^of the sense strand. 

5. A U base at position 10 of the sense strand. 
25 6. An A base at position 14 of the sense strand. 

7. A base other than C at position 19 of the sense strand. 

8. A base other than G at position 1 3 of the sense strand. 

9. A Tm, which refers to the character of the internal repeat that results in 
inter- or intramolecular structures for one strand of the duplex, that is 

30 preferably not stable at greater than 50°C, more preferably not stable at 

greater than 37°C, even more preferably not stable at greater than 30°C 
and most preferably not stable at greater than 20°C. 

1 0. A base other than U at position 5 of the sense strand. 

II. A base other than A at position 11 of the sense strand. 



WO 2004/045543 



22 



PCT/US2003/036787 



Criteria 5, 6, 10 and 1 1 are minor criteria, but are nonetheless desirable. 
Accordingly, preferably an siRNA will satisfy as many of the aforementioned criteria 
as possible, more preferably at least 1-4 and 7-9, and most preferably all of the 
5 criteria 

With respect to the criteria, GC content, as well as a high number of AU in 
positions 15-19, may be important for easement of the unwinding of double stranded 
siRNA duplex. Duplex unwinding has been shown to be crucial for siRNA 
1 0 functionality in vivo . 

With respect to criterion 9, the internal structure is measured in terms of the 
melting temperature of the single strand of siRNA, which is the temperature at which 
50% of the molecules will become denatured. With respect to criteria 2-8 and 10- 
15 1 1, the positions refer to sequence positions on the sense strand, which is the strand 
that is identical to the mRNA. 

In one preferred embodiment, at least criteria 1 and 8 are satisfied. In another 
preferred embodiment, at least criteria 7 and 8 are satisfied. In still another preferred 
20, " embodiment, at least criteria 1, 8 and 9 are satisfied. 



It should be noted that all of the aforementioned criteria regarding sequence 
position specifics are with respect toAesS&end of the sense strand. Reference is 
made to the sense strand, because most databases contain information that describes 
25 the information of the mRNA. Because according to the present invention a chain can 
be from 1 8 to 30 bases in length, and the aforementioned criteria assumes a chain 19 
base pairs in length, it is important to keep the aforementioned criteria applicable to 
the correct bases. 

30 When there are only 1 8 bases, the base pair that is not present is the base pair 

that is located at the 3' of the sense strand. When there are twenty to thirty bases 
present, then additional bases are added at the 5 5 end of the sense chain and occupy 
positions "1 to "1 1 . Accordingly, with respect to SEQ. ED NO. 0001 . 
NNANANNNNUCNAANNNNA and SEQ. ID NO. 0028. 
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GUCNNANANNNNUCNAANNNNA, both would have A at position 3, A at 
position 5, U at position 10, C at position 1 1, A and position 13, A and position 14 
and A at position 19. However, SEQ. ID NO. 0028 would also have C at position -1, 
U at position —2 and G at position -3. 

5 

For a 19 base pair siRNA, an optimal sequence of one of the strands maybe 
represented below, where N is any base, A, C, G, or U: 

SEQ. ID NO. 0001. NN AN ANNNNU CN AANNNN A 
10 SEQ. ID NO. 0002. NNAN ANNNNU GN AANNNN A 

SEQ. ID NO. 0003. NNANANNNNUUNAANNNNA 

SEQ. ID NO. 0004. NNANANlNfNNUCNCANNNNA 

SEQ. ID NO. 0005. NNANANNNNUGNCANNNNA 

SEQ. ID NO. 0006. NNANAhJNNNUUNCANNNNA 
15 SEQ. ID NO. 0007. NNANANNNNUCMJANNNNA 

SEQ. ID NO. 0008.. NNANANNNNUGNUANNNNA 

SEQ. ID NO. 0009. NNANANNKNRJUNUANNNNA 

SEQ. ID NO. 0010. NNANCNNNNUCNAANNNNA 

SEQ. ID NO. 001 1 . NNANCNNNNUGNAANNNNA 
20 -SEQ . ID NO. 00 1 2 . NNANCNNNMJUNAAJSTNNNA 

SEQ. ID NO. 001 3 . NNANCNNNNUCNCANNNNA 

SEQ. ID NO. 0014. NNANCNNNNUGNCANNNNA 

SEQ. ID NO. 0015. NNANCNNNNU^JSAMMNNA 

SEQ. ID NO. 0016. NANCNNNNUCNUANNNNA 
25 SEQ . ID NO. 0017. NNANCNNNNUGNUANNNNA 

SEQ. ID NO. 0018. NNANCNNNNUUNUANNNNA 

SEQ. ID NO. 0019. NNANGNNNNUCNAANNNNA 

SEQ. ID NO. 0020. NTSIANGNNNNUGNAANNNNA 

SEQ. ID NO. 0021 . NNANGNNNNUUNAANNNNA 
30 SEQ. ID NO. 0022. NNANGNNNNUCNCANNNNA 

SEQ. ID NO. 0023 . NNAN GNNNNUGNCANNNN A 

SEQ. ID NO. 0024. NNANGNNNNUUNCANNNNA 

SEQ. ID NO. 0025. NNAN GNNNNU CNU ANNNN A 

SEQ. ID NO. 0026. NN ANGNNNNU GN UANNNNA 
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SEQ. ID NO. 0027. NNANGNNNNNUNUANNNNA 

In one embodiment, the sequence used as an siRNA is selected by choosing 
the siRNA that score highest according to one of the following seven algorithms that 
5 are represented by Formulas I - VII: 

Formula I 

Relative functionality of siRNA= -(GC/3) +(AU 15 . 19 ) -(Tm 20 o C )*3 -(G 13 )*3 -<C 19 ) 
+(A 19 )*2 +(A 3 ) +(U 10 )+(A, 4 ) -(U 5 ) ~(A„) 

10 

Formula II 

Relative functionality of siRNA= -(GC/3) -(AU 15 . 19 )* 3 -(G 13 )*3 -(C 19 ) +(A 19 )*2 
+(A 3 ) 

1 5 Formula III 

Relative functionality of siRNA= -(GC/3) +(AU 15 . 19 ) -(Tm 20 » c )*3 

Formula IV 

Relative functionality of siRNA= 

20 -GC/Z+( AU 15 . I9 )/2-(Tm 20 *c)*2 -(G 13 )*3 _ (Cl9 ) + ( Al9 )*2 +(A 3 ) +(U 10 )+(A 14 ) -(U 5 ) - 
(A,,) 



Formula V 



Relative functionality of siRNA=-(G 13 )*3 -(C 19 ) +(A 19 )*2 +(A 3 ) + (U 10 )+(A 14 ) -(Us) 
25 -(A„) 



Formula VI 

Relative functionality of siRNA=-(G, 3 )*3 -(C 19 ) +(A 19 )*2 +(A 3 ) 
30 Formula VII 

Relative functionality of siRNA=-(GC/2) +(AU 15 . 19 )/2 -( Tm 20 . c )*l - (Gl3 )*3 -(C 19 ) 
+(Ai 9 )*3 +(A 3 )*3 +(U 10 )/2+(A I4 )/2 -(U 5 )/2 -(A„)/2 



In Formulas I - VII: 
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wherein A i9 =1 if A is the base at position 19 on the sense strand, otherwise its value 
is 0 5 

AU 15-19 = 0-5 depending on the number of A or U bases on the sense strand 

at 

5 positions 15 -19; 

G13 = 1 if G is the base at position 13 on the sense strand, otherwise its value is 

0; 

C19 = 1 if C is the base at position 19 of the sense strand, otherwise its value is 

0; 

10 GC= the.number of G and C bases in the entire sense strand; 

Tm 20°c=l if the Tm is greater than 20°C; 

A3= 1 if A is the base at position' 3 on the sense strand, otherwise its value is 0; 
Uio= 1 if U is the base at position 10 on the sense strand, otherwise its value is 

0; 

15 A H = 1 if A is the base at position 14 on the sense strand, otherwise its value 

is 0; 

U5 = 1 if U is the base at position 5 on the sense strand, otherwise its value is 

0; and 

An = 1 if A is the base at position 1 1 of the sense strand, otherwise its value is 
20 0. ~ 



Formulas I -VII provide relative information regarding functionality. When 
the values for two sequences are compared for^gwen-formula, the relative 
functionality is ascertained; a higher positive number indicates a greater functionality. 
25 For example, in many applications a value of 5 or greater is beneficial. 

Additionally, in many applications, more than one of these formulas would 
provide useful information as to the relative functionality of potential siRNA 
sequences. However, it is beneficial to have more than one type of formula, because 
30 not every formula will be able to help to differentiate among potential siRNA 
sequences. For example, in particularly high GC mRNAs, formulas that take that 
parameter into account would not be useful and application of formulas that lack GC 
elements (eg., formulas V and VI) might provide greater insights into duplex 
functionality. Similarly, formula II might by used in situations where hairpin 
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structures are not observed in duplexes, and formula IV might be applicable for 
sequences that have higher AU content. Thus, one may consider a particular sequence 
in light of more than one or even all of these algorithms to obtain the best 
differentiation among sequences. In some instances, application of a given algorithim 
5 may identify an unususally large number of potential siRNA sequences, and in those 
cases, it may be appropriate to re-analyze that sequence with a second algorithm that 
is, for instance, more stringent. Alternatively, it is conceivable that analysis of a 
sequence with a given formula yields no acceptable siRNA sequences (i.e. low 
SMARTscores™). In this instance, it may be appropriate to re-analyze that sequences 

10 with a second algorithm that is, for instance, less stringent. In still other instances, 
analysis of a single sequence with two separate formulas may give rise to conflicting 
results (z.e. one formula generates a set of siRNA with high SMARTscores™ while 
the other formula identifies a set of siRNA with low SMARTscores™). In these 
instances, it may be necessary to determine which weighted factor(s) (e.g. GC 

1 5 content) are contributing to the discrepancy and assessing the sequence to decide 
whether these factors should or should not be included. Alternatively, the sequence 
could be analyzed by a third, fourth, or fifth algorithm to identify a set of rationally 
designed siRNA. 

20 The above-referenced criteria are particularly advantageous when used in 

combination with pooling techniques as depicted in Table I: 



Table I 



Criteria 


Functional Probability 




Oligos 




Pools 






>95% 


>80% 


<70% 


>95% 


>80% 


<70% 


Current 


33.0 


50.0 


23.0 


79.5 


97.3 


0.3 


New 


50.0 


88.5 


8.0 


93.8 


99.98 


0.005 


(GC) 


28.0 


58.9 


36.0 


72.8 


97.1 


1.6 



25 The term "current" refers to Tuschl's conventional siRNA parameters (Elbashir, S.M. 
et al. (2002) "Analysis of gene function in somatic mammalian cells using small 
interfering RNAs" Methods 26: 199-213). "New" refers to the design parameters 
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described in Formulas I- VII. "GC" refers to criteria that select siRNA solely on the 
basis of GC content. 



As Table I indicates, when more functional siRNA duplexes are chosen, 
5 siRNAs that produce <70% silencing drops from 23% to 8% and the number of 
siRNA duplexes that produce >80% silencing rises from 50% to 88.5%. Further, of 
the siRNA duplexes with >80% silencing, a larger portion of these siRNAs actually 
silence >95% of the target expression (the new criteria increases the portion from 
33% to 50%). Using this new criteria in pooled siRNAs, shows that, with pooling, the 
10 amount of silencing >95% increases from 79.5% to 93.8% and essentially eliminates 
any siRNA pool from silencing less than 70%. 

Table II similarly shows the particularly beneficial results of pooling in 
combination with the aforementioned criteria. However, Table II, which takes into 
1 5 account each of the aforementioned variables, demonstrates even a greater degree of 
improvement in functionality. 



Table II 





Functional Probability 


Oligos 


Pools 


Functional 


Average 


Non- 
functional 


Functional 


Average 


Non- 
functional 


Random 


20 


40 


50 


67 


97 


3 


Criteria 1 


52 


99 


0.1 


97- , -^ : — 


-93 


0.0040 


Criteria 4 


89 


99 


0.1 


99 


99 


0.0000 



The terms "functional," "Average," and "Non-functional" refer to siRNA that exhibit 
20 >80%, >50%, and <50% functionality, respectively. Criteria 1 and 4 refer to specific 
criteria described above. 



25 



The above-described algorithms may be used with or without a computer 
program that allows for the inputting of the sequence of the mRNA and automatically 
outputs the optimal siRNA. The computer program may, for example, be accessible 
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from a local terminal or personal computer, over an internal network or over the 
Internet. 



In addition to the formulas above, more detailed algorithms may be used for 
selecting siRNA. Preferably, at least one RNA duplex of between 18 and 30 base 
pairs is selected such that it is optimized according a formula selected from: 



Formula VIII: (-14)*Gi 3 -13*Ai-12*U7-ll*U2-10*Aii-10*U4-10*C3-10*C5-10*C 6 - 
10 9*A 10 -9*U9-9*Ci8-8*G,o-7*Ui-7*Ui6-7*Ci7-7*Ci9 

+7*Ui7+8*A 2 +8*A4+8*A 5 +8*C4+9*G 8 +10*A7+lO*Ui 8 +ll*Ai9+ 
ll*C 9 +15*Gi+ 18*A 3 +19*Uio-Tm^3* (GC to t a i)-6*(GC 15 -i9)- 
' 30*X;and 



15 Formula DC: (14.1)*A 3 +(14.9)*A6+(17.6)*A,3+(24.7)*A 19 +(14.2)*U 1 o+(10.5)* 

C 9 +(23.9)*Gi+(l 6.3) *G 2 +(-l 2.3) *Ai i+(-l 9.3) *Ui+(- 1 2. 1) *U 2 + 
(-1 l)*U 3 +(-l 5.2)*Uis+(-l 1 .3) *Ui 6 + (-1 1 .8) *C 3 +(-17.4)*C 6 +(-. 
10.5) *C 7 + (-13.7)*Gi 3 +(-25.9)*G 19 -Tm-3* (GQ otal ) - 6*(GCi 5 -i 9 )- 
30*X 

wherein 

Ai = 1 if A is the base at position 1 of the sense strand, otherwise its value is 0; 

A2 = 1 if A is the base at position 2 of the sense strand, otherwise its value is 0; 

A3 = 1 if A is the base at position 3 of the sense strand, otherwise its value is 0; 
25 A4 = 1 if A is the base at position 4 of the sense strand, otherwise its value is 0; 

A5 = 1 if A is the base at position 5 of the sense strand, otherwise its value is 0; 

Ae = 1 if A is the base at position 6 of the sense strand, otherwise its value is 0; 

A7= 1 if A is the base at position 7 of the sense strand, otherwise its value is 0; 

Aio= 1 if A is the base at position 10 of the sense strand, otherwise its value is 0; 
30 An = 1 if A is the base at position 11 of the sense strand, otherwise its value is 0; 

An = 1 if A is the base at position 13 of the sense strand, otherwise its value is 0; 

Ai9= 1 if A is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 1 8 base pairs in length, its value is 0; 
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C3 = 1 if C is the base at position 3 of the sense strand, otherwise its value is 0; 
C4 = 1 if C is the base at position 4 of the sense strand, otherwise its value is 0; 
C5 = 1 if C is the base at position 5 of the sense strand, otherwise its value is 0; 
Ce = 1 if C is the base at position 6 of the sense strand, otherwise its value is 0; 
5 C7 = 1 if C is the base at position 7 of the sense strand, otherwise its value is 0; 
C9 = 1 if C is the base at position 9 of the sense strand, otherwise its value is 0; 
Cn = 1 if C is the base at position 17 of the sense strand, otherwise its value is 0; 
Cu = 1 if C is the base at position 18 of the sense strand, otherwise its value is 0; 
C19 = 1 if C is the base at position 19 of the sense strand, otherwise if another base is 
1 0 present or the sense strand is only 1 8 base pairs in length, its value is 0; 

Gi = 1 if G is the base at position 1 on the sense strand, otherwise its value is 0; 
G2 = 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 
Gs = 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 
15 G10 = 1 if G is the base at position 1 0 on the sense strand, otherwise its value is 0; 
Gn =1 if G is the base at position 13 on the sense strand, otherwise its value is 0; 
G19 = 1 if G is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 1 8 base pairs in length, its value is 0; 

20 Ui = 1 if U is thebase at position 1 on the sense strand, otherwise its value is 0; 
U2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0: 
U3 = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0: 
U4 = 1 if U is the base at position 4 on the sense strand, otherwise it-s.-y.alue is 0: 
U 7 = 1 if U is the base at position 7 on the sense strand, otherwise its value is 0: 

25 U9 = 1 if U is the base at position 9 on the sense strand, otherwise its value is 0; 

U10 = 1 if U is the base at position 10 on the sense strand, otherwise its value is 0; 
U15 = 1 if U is the base at position 15 on the sense strand, otherwise its value is 0; 
Uie = 1 if U is the base at position 16 on the sense strand, otherwise its value is 0; 
Un = 1 if U is the base at position 17 on the sense strand, otherwise its value is 0; 

30 Ui8 - 1 if U is the base at position 18 on the sense strand, otherwise its value is 0; 



GC15-19 = the number of G and C bases within positions 15 - 19 of the sense 

strand, or within positions 1 5 -18 if the sense strand is only 1 8 base pairs in 
length; 
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GCtotai = the number of G and C bases in the sense strand; 

Tm = 100 if the siRNA oligo has the internal repeat longer then 4 base pairs, 

otherwise its value is 0; and 
X = the number of times that the same nucleotide repeats four or more times in a 

5 row. 

The above formulas VIII and IX, as well as formulas I - VII, provide methods 
for selecting siRNA in order to increase the efficiency of gene silencing. A subset of 
variables of any of the formulas may be used, though when fewer variables are used, 
10 the optimization hierarchy becomes less reliable. 

With respect to the variables of the above-referenced formulas, a single letter 
of A or C or G or U followed by a subscript refers to a binary condition. The binary 
condition is that either the particular base is present at that particular position 

1 5 (wherein the value is "1 ") or the base is not present (wherein the value is "0"). 

Because position 19 is optional, i.e. there might be only 1 8 base pairs, when there are 
only 18 base pairs, any base with a subscript of 19 in the formulas above would have 
a zero value for that parameter. Before or after each variable is a number followed by 
*, which indicates that the value of the variable is to be multiplied or weighed by that 

20 number. 

The numbers preceding the variables A, or G, or C, or U in Formulas VIII and 
IX (or after the variables in Formula I - VII) were determined byt comparing the 
difference in the frequency of individual bases at different positions in functional 

25 siRNA and total siRNA. Specifically, the frequency in which a given base was 
observed at a particular position in functional groups was compared with the 
frequency that that same base was observed in the total, randomly selected siRNA set. 
If the absolute value of the difference between the functional and total values was 
found to be greater than 6%, that parameter was included in the equation. Thus for 

30 instance, if the frequency of finding a "G" at position 13 (Gi 3 ) is found to be 6% in a 
given functional group, and the frequency of G i3 in the total population of siRNAs is 
20%, the difference between the two values is 6%-20% = -14%. As the absolute value 
is greater than six (6), this factor (-14) is included in the equation. Thus in Formula 
VIII, in cases where the siRNA under study has a G in position 13, the accrued value 
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is (-14) * (1) = -14. In contrast, when a base other than G is found at position 13, the 
accrued value is (-14) * (0) = 0. 



When developing a means to optimize siRNAs, the inventors observed that a 
5 bias toward low internal thermodynamic stability of the duplex at the 5'-antisense 
(AS) end is characteristic of naturally occurring miRNA precursors. The inventors 
extended this observation to siRNAs for which functionality had been assessed in 
tissue culture. 

1 0 With respect to the parameter GC15-19, a value of 0 - 5 will be ascribed 

depending on the number of G or C bases at positions 15 to 19. If there are only 18 
base pairs, the value is between 0 and 4. - • 



With respect to the criterion GC to tai content, a number from 0-30 will be 
1 5 ascribed, which correlates to the total number of G and C nucleotides on the sense 
strand, excluding overhangs. Without wishing to be bound by any one theory, it is 
postulated that the significance of the GC content (as well as AU content at positions 
15-19, which is a parameter for formulas III - VII) relates to the easement of the 
unwinding of a double-stranded siRNA duplex. Duplex unwinding is believed to be 
20 crucial for siRNA functionality in vivo and overall low internal stability, especially 
low internal stability of the first unwound base pair is believed to be important to 
maintain sufficient processivity of RISC complex-induced duplex unwinding. If the 
duplex has 1 9 base pairs, those at positions 1 5- 1 9 on the sense strand-wilkunwind first 
if the molecule exhibits a sufficiently low internal stability at that position. As 
25 persons skilled in the art are aware, RISC is a complex of approximately twelve 
proteins; Dicer is one, but not the only, helicase within this complex. Accordingly, 
although the GC parameters are believed to relate to activity with Dicer, they are also 
important for activity with other RISC proteins. 

30 The value of the parameter Tm is 0 when there are no internal repeats longer 

than (or equal to) four base pairs present in the siRNA duplex; otherwise the value is 
1 . Thus for example, if the sequence ACGUACGU, or any other four nucleotide (or 
more) palindrome exists within the structure, the value will be one (1). Alternatively 
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if the structure ACGGACG, or any other 3 nucleotide (or less) palindrome exists, the 
value will be zero (0). 

The variable "X" refers to the number of times that the same nucleotide occurs 
5 contiguously in a stretch of four or more units. If there are, for example, four 
contiguous As in one part of the sequence and elsewhere in the sequence four 
contiguous Cs, X =2. Further, if there are two separate contiguous stretches of four of 
the same nucleotides or eight or more of the same nucleotides in a row, then X=2. 
However, X does not increase for five, six or seven contiguous nucleotides. 

10 

Again, when applying Formula VIII or Formula IX to a given mRNA, (the 
"target RNA" or "target molecule"), one may use a computer program to evaluate the 
criteria for every sequence of 1 8 - 30 base pairs or only sequences of a fixed length, 
e.g., 19 base pairs. Preferably the computer program is designed such that it provides 

1 5 a report ranking of all of the potential siRNAs between 1 8 and 30 base pairs, ranked 
according to which sequences generate the highest value. A higher value refers to a 
more efficient siRNA for a particular target gene. The computer program that may be 
used, may be developed in any computer language that is known to be useful for 
scoring nucleotide sequences, or it may be developed with the assistance of 

20 commercially available product such as Microsoft's product .net. Additionally, rather 
than run every sequence through one and/or another formula, one may compare a 
subset of the sequences, which may be desirable if for example only a subset are 
available. For instance, it may be desirable to first perform a BLAST (BastesJfcoeaL.. 
Alignment Search Tool) search and to identify sequences that have no homology to 

25 other targets. Alternatively, it may be desirable to scan the sequence and to identify 
regions of moderate GC context, then perform relevant calculations using one of the 
above-described formulas on these regions. These calculations can be done manually 
or with the aid of a computer. 

30 As with Formulas I - VII, either Formula VIII or Formula IX may be used for 

a given mRNA target sequence. However, it is possible that according to one or the 
other formula more than one siRNA will have the same value. Accordingly, it is 
beneficial to have a second formula by which to differentiate sequences. Formula IX 
was derived in a similar fashion as Formula VIII, yet used a larger data set and thus 
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yields sequences with higher statistical correlations to highly functional duplexes. The 
sequence that has the highest value ascribed to it may be referred to as a "first 
optimized duplex." The sequence that has the second highest value ascribed to it may 
be referred to as a "second optimized duplex." Similarly, the sequences that have the 
5 third and fourth highest values ascribed to them may be referred to as a third 

optimized duplex and a fourth optimized duplex, respectively. When more than one 
sequence has the same value, each of them may, for example, be referred to as first 
optimized duplex sequences or co-first optimized duplexes. 

10 SiRNA sequences identified using Formula VIII are contained within the 

enclosed compact disks. The data included on the enclosed compact disks is 
described more fully below. The sequences identified by Formula VIII that are 
disclosed in the compacts disks may be used in gene silencing applications. 

15 It should be noted that for Formulas VIE and EK all of the aforementioned 

criteria are identified as positions on the sense strand when oriented in the 5' to 3' 
direction as they are identified in connection with Formulas I — VII unless otherwise 
specified. 

20 Formulas I - IX, may be used^to select or to evaluate one, or more than one, 

siRNA in order to optimize silencing. Preferably, at least two optimized siRNAs that 
have been selected according to at least one of these formulas are used to silence a 
gene, more preferably at least three and most preferably at least four. The siRNAs-v^ ; 
may be used individually or together in a pool or kit. Further, they may be applied to 

25 a cell simultaneously or separately. Preferably, the at least two siRNAs are applied 
simultaneously. Pools are particularly beneficial for many research applications. 
However, for therapeutics, it may be more desirable to employ a single 
hyperfunctional siRNA as described elsewhere in this application. 

30 When planning to conduct gene silencing, and it is necessary to choose 

between two or more siRNAs, one should do so by comparing the relative values 
when the siRNA are subjected to one of the formulas above. In general a higher 
scored siRNA should be used. 
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Useful applications include, but are not limited to, target validation, gene 
functional analysis, research and drug discovery, gene therapy and therapeutics. 
Methods for using siRNA in these applications are well known to persons of skill in 
the art. 



Because the ability of siRNA to function is dependent on the sequence of the 
RNA and not the species into which it is introduced, the present invention is 
applicable across a broad range of species, including but not limited to all mammalian 
species, such as humans, dogs, horses, cats, cows, mice, hamsters, chimpanzees and 
10 gorillas, as well as other species and organisms such as bacteria, viruses, insects, 
plants and C. elegans. 

The present invention is also applicable for use for silencing a broad range of 
genes, including but not limited to the roughly 45,000 genes of a human genome, and 
15 has particular relevance in cases where those genes are associated with diseases such 
as diabetes, Alzheimer's, cancer, as well as all genes in the genomes of the 
aforementioned organisms. 

The siRNA selected according to the aforementioned criteria or one of the 
20 aforementioned algorithms are also, for example, useful in the simultaneous screening 
and functional analysis of multiple genes and gene families using high throughput 
strategies, as well as in direct gene suppression or silencing. 



Development of the Algorithms 

25 To identify siRNA sequence features that promote functionality and to 

quantify the importance of certain currently accepted conventional factors — such as 
G/C content and target site accessibility— the inventors synthesized an siRNA panel 
consisting of 270 siRNAs targeting three genes, Human Cyclophilin, Firefly 
Luciferase, and Human DBI. In all three cases, siRNAs were directed against specific 

30 regions of each gene. For Human Cyclophilin and Firefly Luciferase, ninety siRNAs 
were directed against a 199 bp segment of each respective mRNA. For DBI, 90 
siRNAs were directed against a smaller, 109 base pair region of the mRNA. The 
sequences to which the siRNAs were directed are provided below. 
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It should be noted that in certain sequences, "t" is present. This is because 
many databases contain information in this manner. However, the t denotes a uracil 
residue in mRNA and siRNA. Any algorithm will, unless otherwise specified, 
process a t in a sequence as a u. 

5 

Human cvclophilin: 193—390. M60857 
SEQ. ID NO. 29: 

gttccaaaaacagtggataattttgtggccttagctacaggagagaaaggatttggctacaaaaacagcaaattccatcgtgt 
aatcaaggacttcatgatccagggcggagacttcaccaggggagatggcacaggaggaaagagcatctacggtgagcg 
10 cttccccgatgagaacttcaaactgaagcactacgggcctggctggg 



Firefly luciferase: 1434— 163 L U47298 (r>GL3, Promeea^ 
SEQ, ID NO. 30: 

tgaacttcccgccgccgttgttgttttggagcacggaaagacgatgacggaaaaagagatcgtggattacgtcgccagtca 
15 agtaacaaccgcgaaaaagttgcgcggaggagttgtgtttgtggacgaagtaccgaaaggtcttaccggaaaactcgacg 
caagaaaaatcagagagatcctcataaaggccaagaagg 

DBL NM 020548 ( 202-310) fevery position) 
SEQ. ID NO. 0031: 

20 acgggcaaggccaagtgggatgcctggaatgagctgaaagggacttccaaggaagatgccatgaaagcttacatcaaca 
aagtagaagagctaaagaaaaaatacggg 

A list of the siRNAs appears in Table III (see Examples Section, Example II) 

25 The set of duplexes was analyzed to identify correlations between siRNA 

functionality and other biophysical or thermodynamic properties. When the siRNA 
panel was analyzed in functional and non-functional subgroups, certain nucleotides 
were much more abundant at certain positions in functional or non-functional groups. 
More specifically, the frequency of each nucleotide at each position in highly 

30 functional siRNA duplexes was compared with that of nonfunctional duplexes in 
order to assess the preference for or against any given nucleotide at every position. 
These analyses were used to determine important criteria to be included in the siRNA 
algorithms (Formulas VIII and IX). 
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The data set was also analyzed for distinguishing biophysical properties of 
siRNAs in the functional group, such as optimal percent of GC content, propensity for 
internal structures and regional thermodynamic stability. Of the presented criteria, 
several are involved in duplex recognition, RISC activation/duplex unwinding, and 
5 target cleavage catalysis. 



The original data set that was the source of the statistically derived criteria is 
shown in Figure 2. Additionally, this figure shows that random selection yields 
siRNA duplexes with unpredictable and widely varying silencing potencies as 

10 measured in tissue culture using HEK293 cells. In the figure, duplexes are plotted 

such that each x-axis tick-mark represents an individual siRNA, with each subsequent 
siRNA differing in target position by two nucleotides for Human Cyclophilin and 
Firefly Luciferase, and by one nucleotide for Human DBI. Furthermore, the y-axis 
denotes the level of target expression remaining after transfection of the duplex into 

1 5 cells and subsequent silencing of the target. 

SiRNA identified and optimized in this document work equally well in a wide 
range of cell types. Figure 3a shows the evaluation of thirty siRNAs targeting the 
DBI gene in three cell lines derived from different tissues. Each DBI siRNA displays 

20 very similar functionality in HEK293 (ATCC, CRI>1 573, human embryonic kidney), 
HeLa (ATCC, CCL-2, cervical epithelial adenocarcinoma) and DU145 (HTB-81, 
prostate) cells as deterimined by the B-DNA assay. Thus, siRNA functionality is 
* determined by the primary sequence of the siRNA and not by the intracellular 
environment. Additionally, it should be noted that although the present invention 

25 provides for a determination of the functionality of siRNA for a given target, the same 
siRNA may silence more than one gene. For example, the complementary sequence 
of the silencing siRNA may be present in more than one gene. Accordingly, in these 
circumstances, it may be desirable not to use the siRNA with highest SMARTscore™. 
In such circumstances, it may be desirable to use the siRNA with the next highest 

30 SMARTscore™. 



To determine the relevance of G/C content in siRNA function, the G/C content 
of each duplex in the panel was calculated and the functional classes of siRNAs 
(<F50, > F50, > F80, > F95 where F refers to the percent gene silencing) were sorted 
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accordingly. The majority of the highly-functional siRNAs (>F95) fell within the 
G/C content range of 36%— 52% (Figure 3B). Twice as many non-functional (< 
F50) duplexes fell within the high G/C content groups (>57% GC content) compared 
to the 36%— 52% group. The group with extremely low GC content (26% or less) 

5 contained a higher proportion of non-functional siRNAs and no highly-functional 
siRNAs. The G/C content range of 30%— 52% was therefore selected as Criterion I 
for siRNA functionality, consistent with the observation that a G/C range 30% — 70% 
promotes efficient RNAi targeting. Application of this criterion alone provided only a 
marginal increase in the probability of selecting functional siRNAs from the panel: 

10 selection of F50 and F95 siRNAs was improved by 3.6% and 2.2%, respectively. The 
siRNA panel presented here permitted a more systematic analysis and quantification 
of the importance of this criterion than that used previously. 



A relative measure of local internal stability is the A/U base pair (bp) content; 

1 5 therefore, the frequency of A/U bp was determined for each of the five terminal 
positions of the duplex (5' sense (S)/5' antisense (AS)) of all siRNAs in the panel. 
Duplexes were then categorized by the number of A/U bp in positions 1 — 5 and 1 5 — 
19 of the sense strand. The thermodynamic flexibility of the duplex 5'-end (positions 
1 — 5 ; s) did not appear to correlate appreciably with silencing potency, while that of 

20 the 3 '-end (positions 15—19; S) correlated with efficient silencing. No duplexes 

lacking A/U bp in positions 15—19 were functional. The presence of one A/U bp in 
this region conferred some degree of functionality, but the presence of three or more 
A/Us:was preferable and therefore defined as Criterion II. When applied to the test 
panel, only a marginal increase in the probability of functional siRNA selection was 

25 achieved: a 1.8% and 2.3% increase for F50 and F95 duplexes, respectively (Table 
IV). 



The complementary strands of siRNAs that contain internal repeats or 
palindromes may form internal fold-back structures. These hairpin-like structures 
30 exist in equilibrium with the duplexed form effectively reducing the concentration of 
functional duplexes. The propensity to form internal hairpins and their relative 
stability can be estimated by predicted melting temperatures. High Tm reflects a 
tendency to form hairpin structures. Lower Tm values indicate a lesser tendency to 
form hairpins. When the functional classes of siRNAs were sorted by T,„ (Figure 3c), 
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the following trends were identified: duplexes lacking stable internal repeats were the 
most potent silencers (no F95 duplex with predicted hairpin structure T m > 60 °C). In 
contrast, about 60% of the duplexes in the groups having internal hairpins with 
calculated T m values less than 20 °C were F80. Thus, the stability of internal repeats 
5 is inversely proportional to the silencing effect and defines Criterion III (predicted 
hairpin structure T m < 20 °C). 

Sequence-based determinants of siRNA functionality 

When the siRNA panel was sorted into functional and non-functional groups, 

10 the frequency of a specific nucleotide at each position in a functional siRNA duplex 
was compared with that of a nonfunctional duplex in order to assess the preference for 
or against a certain nucleotide. Figure 4 shows the results of these queries and the 
subsequent resorting of the data set (from Figure 2). The data is separated into two 
sets: those duplexes that meet the criteria, a specific nucleotide in a certain position - 

1 5 grouped on the left (Selected) and those that do not - grouped on the right 

(Eliminated). The duplexes are further sorted from most functional to least functional 
with the y-axis of Figure 4a-e representing the % expression i.e. the amount of 
silencing that is elicited by the duplex (Note: each position on the X-axis represents a 
different duplex). Statistical analysis revealed correlations between silencing and 

20 several sequence-related properties of siRNAs. Figure-4~and Table IV show 

quantitative analysis for the following five sequence-related properties of siRNA: (A) 
an A at position 19 of the sense strand; (B) an A at position 3 of the sense strand; (C) 
a U at position 1 0 of the sense strand; (D) a base other than G at position 1 3 of the 
sense strand; and (E) a base other than C at position 19 of the sense strand. 

25 

When the siRNAs in the panel were evaluated for the presence of an A at 
position 19 of the sense strand, the percentage of non-functional duplexes decreased 
from 20% to 1 1 .8%, and the percentage of F95 duplexes increased from 21 .7% to 
29.4% (Table IV). Thus, the presence of an A in this position defined Criterion IV. 

30 

Another sequence-related property correlated with silencing was the presence 
of an A in position 3 of the sense strand (Figure 4b). Of the siRNAs with A3, 34.4% 
were F95, compared with 21 .7% randomly selected siRNAs. The presence of a U 
base in position 1 0 of the sense strand exhibited an even greater impact (Figure 4c). 
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Of the duplexes in this group, 41 .7% were F95. These properties became criteria V 
and VI, respectively. 



Two negative sequence-related criteria that were identified also appear on 
5 Figure 4. The absence of a G at position 13 of the sense strand, conferred a marginal 
increase in selecting functional duplexes (Figure 4d). Similarly, lack of a C at 
position 19 of the sense strand also correlated with functionality (Figure 4e). Thus, 
among functional duplexes, position 19 was most likely occupied by A, and rarely 
occupied by C. These rules were defined as criteria VII and VIII, respectively. 

10 

* 

Application of each criterion individually provided marginal but statistically 
significant increases in the probability of selecting a potent siRNA. Although the 
results were informative, the inventors sought to maximize potency and therefore 
consider multiple criteria or parameters. Optimization is particularly important when 
1 5 developing therapeutics. Interestingly, the probability of selecting a functional siRNA 
based on each thermodynamic criteria was 2% — 4% higher than random, but 4% — 
8% higher for the sequence-related determinates. Presumably, these sequence-related 
increases reflect the complexity of the RNAi mechanism and the multitude of protein- 
RNA interactions that are involved in RNAi-mediated silencing. 
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V. An A base at position 3 
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of the sense strand 


> P^n 
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6.4% 




>F95 


34.4% 


12.7% 


VI. A U base at position 10 


<F50 


13.9% 


-6.1% 


of the sense strand 


>F50 


86.1% 


■ 6.1% 




>F80 


69.4% 


13.3% 




>F95 . 
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VII. A base other than C at 


<F50 


18.8% 


-1.2% 


position 19 of the sense strand 


x >F50 
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1.2% 




>F80 


59.7% 


3.6% 




>F95 


24.2% 


2.5% 


VIII. A base other than G at 


<F50 


15.2% 


-4.8% 


position 1 3 of the sense strand 


>F50 


84.8% 


4.8% 




> F80 


61.4% 


5.3% 




>F95 


26.5% 


4.8% 



25 

The siRNA selection algorithm 

In an effort to improve selection further, all identified criteria, including but 
not limited to those listed in Table IV were combined into the algorithms embodied in 
Formula VIII and Formula EK. Each siRNA was then assigned a score (referred to as 
30 a SMARTscore™) according to the values derived from the formulas.- Duplexes that 
scored higher than 0 or 20, for Formulas VIII and IX, respectively, effectively 
selected a set of functional siRNAs and excluded all non-functional siRNAs. 
Conversely, all duplexes scoring lower than 0 and 20 according to formulas VIII and 
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IX, respectively, contained some functional siRNAs but included aU non-functional 
siRNAs. A graphical representation of this selection is shown in Figure 5. 



The methods for obtaining the seven criteria embodied in Table IV are 
5 illustrative of the results of the process used to develop the information for Formulas 
VIII and IX. Thus similar techniques were used to establish the other variables and 
their multipliers. As described above, basic statistical methods were use to determine 
the relative values for these multipliers. 

10 To determine the value for "Improvement over Random" the difference in the 

frequency of a given attribute (e.g. GC content, base preference) at a particular 
position is determined between individual functional groups (e.g. <F50) and the total 
siRNA population studied (e.g. 270 siRNA molecules selected randomly). Thus, for 
instance, in Criterion I (30%-52% GC content) members of the <F50 group were 

1 5 observed to have GC contents between 30-52% in 16.4% of the cases. In contrast, the 
total group of 270 siRNAs had GC contents in this range, 20% of the time. Thus for 
this particular attribute, there is a small negative correlation between 30%-52% GC 
content and this functional group (i.e. 16.4%-20% = -3.6%). Similarly, for Criterion 
VI, (a "U" at position 10 of the sense strand), the >F95 group contained a "U" at this 

20 position 41 .7% of the time. In contrast, the total group of 270 siRNAs had a "U" at 
this position 21.7% of the time, thus the improvement over random is calculated to be 
20%(or4L7%-21.7%). 

Identifying The Average Internal Stability Profile of Strong siRNA 
25 In order to identify an internal stability profile that is characteristic of strong 

siRNA, 270 different siRNAs derived from the cyclophilin B, the diazepam binding 
inhibitor (DBI), and the luciferase gene were individually transfected into HEK293 
cells and tested for their ability to induce RNAi of the respective gene. Based on their 
performance in the in vivo assay, the sequences were then subdivided into three 
30 groups, (i) >95% silencing; (ii) 80-95% silencing; and (iii) less than 50% silencing. 
Sequences exhibiting 51-84% silencing were eliminated from further consideration to 
reduce the difficulties in identifying relevant thermodynamic patterns. 
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Following the division of siRNA into three groups, a statistical analysis was 
performed on each member of each group to determine the average internal stability 
profile (AISP) of the siRNA. To accomplish this the Oligo 5.0 Primer Analysis 
Software and other related statistical packages (e.g. Excel) were exploited to 

5 determine the internal stability of pentamers using the nearest neighbor method 

described by Freier et aL, (1986) Improved free-energy parameters for predictions of 
RNA duplex stability, Proc Natl. Acad. Sci. U. S. A. 83(24): 9373-7. Values for each 
group at each position were then averaged, and the resulting data were graphed on a 
linear coordinate system with the Y-axis expressing the AG (free energy) values in 

10 kcal/mole and the X-axis identifying the position of the base relative to the 5' end. 

The results of the analysis identified multiple key regions in siRNA molecules 
that were critical for successful gene silencing. At the 3 5 -most end of the sense strand 
(5'antisense), highly functional siRNA (>95% gene silencing, see Figure 6a, >F95) 

1 5 have a low internal stability (AISP of position 19 = ~ -7.6kcal/mol). In contrast low- 
efficiency siRNA (i.e. those exhibiting less than 50% silencing, <F50) display a 
distinctly different profile, having high AG values (~ -8.4kcal/mol) for the same 
position. Moving in a 5' (sense strand) direction, the internal stability of highly 
efficient siRNA rises (position 12 = ~ ~8.3kcal/mole) and then drops again (position 7 

20 = ~ -7.7kcal/mol) before leveling off at a value of approximately -8.1 kcal/mol for the 
5' terminus. SiRNA with poor silencing capabilities show a distinctly different 
profile. While the AISP value at position 12 is nearly identical with that of strong 
siRNAs, the values at positions 7 and 8 rise considerably, peaking at a high of ~ -9.0 
kcal/mol. In addition, at the 5' end of the molecule the AISP profile of strong and 

25 weak siRNA differ dramatically. Unlike the relatively strong values exhibited by 

siRNA in the >95% silencing group, siRNAs that exhibit poor silencing activity have 
weak AISP values (-7.6, -7.5, and -7.5 kcal/mol for positions 1, 2 and 3 respectively). 

Overall the profiles of both strong and weak siRNAs form distinct sinusoidal 
30 shapes that are roughly 180° out-of-phase with each other. While these 

thermodynamic descriptions define the archetypal profile of a strong siRNA, it will 
likely be the case that neither the AG values given for key positions in the profile or 
the absolute position of the profile along the Y-axis (i.e. the AG -axis) are absolutes. 
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Profiles that are shifted upward or downward (i.e. having on an average, higher or 
lower values at every position) but retain the relative shape and position of the profile 
along the X-axis can be foreseen as being equally effective as the model profile 
described here. Moreover, it is likely that siRNA that have strong or even stronger 

5 gene-specific silencing effects might have exaggerated AG values (either higher or 
lower) at key positions. Thus, for instance, it is possible that the 5 '-most position of 
the sense strand (position 19) could have AG values of 7.4 kcal/mol or lower and still 
be a strong siRNA if, for instance, a G-C -> G-T/U mismatch were substituted at 
position 19 and altered duplex stability. Similarly, position 12 and position 7 could 

10 have values above 8.3 kcal/mol and below 7.7 kcal/mole, respectively, without 

abating the silencing effectiveness of the molecule. Thus, for instance, at position 12, 
a stabilizing chemical modification (e.g. a chemical modification of the 2 s position of 
the sugar backbone) could be added that increases the average internal stability at that 
position. Similarly, at position 7, mismatches similar to those described previously 

1 5 could be introduced that would lower the AG values at that position. 



Lastly, it is important to note that while functional and non-functional siRNA 
were originally defined as those molecules having specific silencing properties, both 
broader or more limiting parameters can be used to define these molecules. As used 

20 herein, unless otherwise specified, "non-functional siRNA" are defined as those 

siRNA that induce less than 50% (<50%) target silencing, "semi-functional siRNA" 
induce 50-79% target silencing, "functional siRNA" are molecules that induce 80- 
95% gene silencing, and ^Mghly-functional siRNA" are molecules that induce great 
than 95% gene silencing. These definitions are not intended to be rigid and can vary 

25 depending upon the design and needs of the application. For instance, it is possible 
that a researcher attempting to map a gene to a chromosome using a functional assay, 
may identify an siRNA that reduces gene activity by only 30%. While this level of 
gene silencing may be "non- functional" for e.g. therapeutic needs, it is sufficient for 
gene mapping purposes and is, under these uses and conditions, "functional." For 

30 these reasons, functional siRNA can be defined as those molecules having greater 
than 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 90% silencing capabilities at 
lOOnM transfection conditions. Similarly, depending upon the needs of the study 
and/or application, non-functional and semi-functional siRNA can be defined as 
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having different parameters. For instance, semi-functional siRNA can be defined as 
being those molecules that induce 20%, 30%, 40%, 50%, 60%, or 70% silencing at 
lOOnM transfection conditions. Similarly, non-functional siRNA can be defined as 
being those molecules that silence gene expression by less than 70%, 60%, 50%, 
5 40%, 30%, or less. Nonetheless, unless otherwise stated, the descriptions stated in 
the "Definitions" section of this text should be applied. 



Functional attributes can be assigned to each of the key positions in the AISP 
of strong siRNA. The low 5' (sense strand) AISP values of strong siRNAs may be 
10 necessary for determining which end of the molecule enters the RISC complex. In 

contrast, the high and low AISP values observed in the central regions of the molecule 
may be critical for siRNA-target mRNA interactions and product release, 
respectively. 

1 5 If the AISP values described above accurately define the thermodynamic 

parameters of strong siRNA, it would be expected that similar patterns would be 
observed in strong siRNA isolated from nature. Natural siRNAs exist in a harsh, 
RNase-rich environment and it can be hypothesized that only those siRNA that 
exhibit heightened affinity for RISC {i.e. siRNA that exhibit an average internal 

20 stability profile similar to those observed in strong siRNA) would survive in an.-, 
intracellular environment. This hypothesis was tested using GFP-specific siRNA 
isolated from K benthamiana. Llave et al (2002) Endogenous and Silencing- 
Associated Small RNAs in Plants, She Plant Cell 14, 1605-1619, introduced long 
double-stranded GFP-encoding RNA into plants and subsequently re-isolated GFP- 

25 specific siRNA from the tissues. The AISP of fifty-nine of these GFP-siRNA were 
determined, averaged, and subsequently plotted alongside the AISP profile obtained 
from the cyclophilin B/DBI/ luciferase siRNA having >90% silencing properties 
(Figure 6b). Comparison of the two groups show that profiles are nearly identical. 
This finding validates the information provided by the internal stability profiles and 

30 demonstrates that: (1) the profile identified by analysis of the cyclophilin B/DBI/ 

luciferase siRNAs are not gene specific; and (2) AISP values can be used to search for 
strong siRNAs in a variety of species. 
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Both chemical modifications and base-pair mismatches can be incorporated 
into siRNA to alter the duplex's AISP and functionality. For instance, introduction of 
mismatches at positions 1 or 2 of the sense strand destabilized the 5'end of the sense 
strand and increases the functionality of the molecule (see Luc, Figure 7). Similarly, 
5 addition of 2 5 -0-methyl groups to positions 1 and 2 of the sense strand can also alter 
the AISP and (as a result) increase both the functionality of the molecule and 
eliminate off-target effects that results from sense strand homology with the unrelated 
targets (Figures 8a, 8b). 

10 Rationale for Criteria in a Biological Context 

The fate of siRNA in the RNAi pathway may be described in 5 major steps: 
(1) duplex recognition and pre-RISC complex formation; (2) ATP-dependent duplex 
unwinding/strand selection and RISC activation; (3) mRNA target identification; (4) 
mRNA cleavage, and (5) product release (Figure 1). Given the level of nucleic acid- 

1 5 protein interactions at each step, siRNA functionality is likely influenced by specific 
biophysical and molecular properties that promote efficient interactions within the 
context of the multi-component complexes. Indeed, the systematic analysis of the 
siRNA test set identified multiple factors that correlate well with functionality. When 
combined into a single algorithm, they proved to be very effective in selecting active 

20 siRNAs. 

The factors described here may also be predictive of key functional 
associations important for each stepinJRNAi. For example, the potential formation of 
internal hairpin structures correlated negatively with siRNA functionality. 

25 Complementary strands with stable internal repeats are more likely to exist as stable 
hairpins thus decreasing the effective concentration of the functional duplex form. 
This suggests that the duplex is the preferred conformation for initial pre-RISC 
association. Indeed, although single complementary strands can induce gene . 
silencing, the effective concentration required is at least two orders of magnitude 

30 higher than that of the duplex form. 

siRNA-pre-RISC complex formation is followed by an ATP-dependent duplex 
unwinding step and "activation" of the RISC. The siRNA functionality was shown to 
correlate with overall low internal stability of the duplex and low internal stability of 
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the 3 1 sense end (or differential internal stability of the 3' sense compare to the 5' 
sense strand), which may reflect strand selection and entry into the RISC. Overall 
duplex stability and low internal stability at the 3' end of the sense strand were also 
correlated with siRNA functionality. Interestingly, siRNAs with very high and very 
5 low overall stability profiles correlate strongly with non-functional duplexes. One 
interpretation is that high internal stability prevents efficient unwinding while very 
low stability reduces siRNA target affinity and subsequent mRNA cleavage by the 
RISC. 

10 Several criteria describe base preferences at specific positions of the sense 

strand and are even more intriguing when considering their potential mechanistic 
roles in target recognition^and mRNA cleavage. Base preferences for A at position 19 
of the sense strand but not C, are particularly interesting because they reflect the same 
base preferences observed for naturally occurring miRNA precursors. That is, among 

15 the reported miRNA precursor sequences 75% contain a U at position 1 which 

corresponds to an A in position 19 of the sense strand of siRNAs, while G was under- 
represented in this same position for miRNA precursors. These observations support 
the hypothesis that both miRNA precursors and siRNA duplexes are processed by 
very similar if not identical protein machinery. The functional interpretation of the 

20- predominance of a U/A base pair is that it promotes flexibility at the 5 'antisense ends ' 
of both siRNA duplexes and miRNA precursors and facilitates efficient unwinding 
and selective strand entrance into an activated RISC. 

Among the criteria associated with base preferences that are likely to 
25 influence mRNA cleavage or possibly product release, the preference for U at 
position 10 of the sense strand exhibited the greatest impact, enhancing the 
probability of selecting an F80 sequence by 13.3%. Activated RISC preferentially 
cleaves target mRNA between nucleotides 10 and 1 1 relative to the 5' end of the 
complementary targeting strand. Therefore, it may be that U, the preferred base for 
30 most endoribonucleases, at this position supports more efficient cleavage. 

Alternatively, a U/A bp between the targeting siRNA strand and its cognate target 
mRNA may create an optimal conformation for the RISC-associated "slicing" 
activity. 
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Pooling 

According to another embodiment, the present invention provides a pool of at 
least two siRNAs, preferably in the form of a kit or therapeutic reagent, wherein one 
strand of each of the siRNAs, the sense strand comprises a sequence that is 
5 substantially similar to a sequence within a target mRNA. The opposite strand, the 
antisense strand, will preferably comprise a sequence that is substantially 
complementary to that of the target mRNA. More preferably, one strand of each 
siRNA will comprise a sequence that is identical to a sequence that is contained in the 
target mRNA. Most preferably, each siRNA will be 1 9 base pairs in length, and one 
10 strand of each of the siRNAs will be 100% complementary to a portion of the target 
mRNA. 

By increasing the number of siRNAs directed to a particular target using a 
pool or kit, one is able both to increase the likelihood that at least one siRNA with 

1 5 satisfactory functionality will be included, as well as to benefit from additive or 

synergistic effects. Further, when two or more siRNAs directed against a single gene 
do not have satisfactory levels of functionality alone, if combined, they may 
satisfactorily promote degradation of the target messenger RNA and successfully 
inhibit translation. By including multiple siRNAs in the system, not only is the 

20 - probability of silencing increased, but the economics of operation are also improved 
when compared to adding different siRNAs sequentially. This effect is contrary to the 
conventional wisdom that the concurrent use of multiple siRNA will negatively 
impact gene silencing (e.g. Holen, T.,efea/.*(2-Q03) "Similar behavior of single strand 
and double strand siRNAs suggests they act through a common RNAi pathway." 

25 NAR 31: 2401-21407). 

In fact, when two siRNAs were pooled together, 54% of the pools of two 
siRNAs induced more than 95% gene silencing. Thus, a 2.5-fold increase in the 
percentage of functionality was achieved by randomly combining two siRNAs. 
30 Further, over 84% of pools containing two siRNAs induced more than 80% gene 
silencing. 

More preferably, the kit is comprised of at least three siRNAs, wherein one 
strand of each siRNA comprises a sequence that is substantially similar to a sequence 
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of the target mRNA and the other strand comprises a sequence that is substantially 
complementary to the region of the target mRNA. As with the kit that comprises at 
least two siRNAs, more preferably one strand will comprise a sequence that is 
identical to a sequence that is contained in the mRNA and another strand that is 100% 
5 complementary to a sequence that is contained in the mRNA. During experiments, 
when three siRNAs were combined together, 60% of the pools induced more than 
95% gene silencing and 92% of the pools induced more than 80% gene silencing. 

Further, even more preferably, the kit is comprised of at least four siRNAs, 
wherein one strand of each siRNA comprises a sequence that is substantially similar 
to a region of the sequence of the target mRNA, and the other strand comprises a 
sequence that is substantially complementary to the region of the target mRNA. As 
with the kit or pool that comprises at least two siRNAs, more preferably one strand of 
each of the siRNA duplexes will comprise a sequence that is identical to a sequence 
that is contained in the mRNA, and another strand that is 100% complementary to a 
sequence that is contained in the mRNA. 

Additionally, kits and pools with at least five, at least six, and at least seven 
siRNAs may also be useful with the present invention. For example, pools of five 
siRNA induced 95% gene silencing with 77% probability and 80% silencing with 
98.8% probability. Thus, pooling of siRNAs together can result in the creation of a 
target-specific silencing reagent with almost a 99% probability of being functional. 
The fact that such high levels of success are achievable using such pools of siRNA, 
enables one to dispense with costly and time-consuming target-specific validation 
procedures. 

For this embodiment, as well as the other aforementioned embodiments, each 
of the siRNAs within a pool will preferably comprise between 18 and 30 base pairs, 
more preferably between 1 8 and 25 base pairs, and most preferably 1 9 base pairs. 
30 Within each siRNA, preferably at least 18 contiguous bases of the antisense strand 
will be 100% complementary to the target mRNA. More preferably, at least 19 
contiguous bases of the antisense strand will be 1 00% complementary to the target 
mRNA. Additionally, there may be overhangs on either the sense strand or the 
antisense strand, and these overhangs may be at either the 5' end or the 3 ' end of 
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either of the strands, for example there may be one or more overhangs of 1 -6 bases. 
When overhangs are present, they are not included in the calculation of the number of 
base pairs. The two nucleotide 3' overhangs mimic natural siRNAs and are 
commonly used but are not essential. Preferably, the overhangs should consist of two 

5 nucleotides, most often dTdT or UU at the 3 ' end of the sense and antisense strand 
that are not complementary to the target sequence. The siRNAs may be produced by 
any method that is now known or that comes to be known for synthesizing double 
stranded RNA that one skilled in the art would appreciate would be useful in the 
present invention. Preferably, the siRNAs will be produced by Dharmacon's 

1 0 proprietary ACE® technology. However, other methods for synthesizing siRNAs are 
well known to persons skilled in the art and include, but are not limited to, any 
chemical synthesis of RNA oligonucleotides, ligation of shorter oligonucleotides, in 
vitro transcription of RNA oligonucleotides, the use of vectors for expression within 
cells, recombinant Dicer products and PCR products. 

15 

The siRNA duplexes within the aforementioned pools of siRNAs may 
correspond to overlapping sequences within a particular mRNA, or non-overlapping 
sequences of the mRNA. However, preferably they correspond to non-overlapping 
sequences. Further, each siRNA may be selected randomly, or one or more of the 
20 siRNA ihay be selected according to the criteria discussed above for maximizing the 
effectiveness of siRNA. 

Included in the definition of siRNAs are siRNAs that contain substituted 
and/or labeled nucleotides that may, for example, be labeled by radioactivity, 
25 fluorescence or mass. The most common substitutions are at the T position of the 

ribose sugar, where moieties such as H (hydrogen) F, NH3, OCH3 and other O- alkyl, 
alkenyl, alkynyl, and orthoesters, may be substituted, or in the phosphorous backbone, 
• where sulfur, amines or hydrocarbons may be substituted for the bridging of non- 
bridging atomsjn the phosphodiester bond. Examples of modified siRNAs are 
30 explained more fully in commonly assigned U.S. Patent Application Ser. No. 
10/613,077, filed July 1 , 2003, which is incorporated by reference herein. 

Additionally, as noted above, the cell type into which the siRNA is introduced 
may affect the ability of the siRNA to enter the cell; however, it does not appear to 
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affect the ability of the siRNA to function once it enters the cell. Methods for 
introducing double-stranded RNA into various cell types are well known to persons 
skilled in the art. 



5 As persons skilled in the art are aware, in certain species, the presence of 

proteins such as RdRP, the RNA-dependent RNA polymerase, may catalytically 
enhance the activity of the siRNA. For example, RdRP propagates the RNAi effect in 
C elegans and other non-mammalian organisms. In fact, in organisms that contain 
these proteins, the siRNA may be inherited. Two other proteins that are well studied 
1 0 and known to be a part of the machinery are members of the Argonaute family and 
Dicer, as well as their homologues. There is also initial evidence that the RISC 
complex might be associated with the ribosome so the more efficiently translated 
mRNAs will be more susceptible to silencing than others. 

15 Another very important factor in the efficacy of siRNA is mRNA localization. 

In general, only cytoplasmic mRNAs are considered to be accessible to RNAi to any 
appreciable degree. However, appropriately designed siRNAs, for example, siRNAs 
modified with internucleotide linkages, may be able to cause silencing by acting in the 
nucleus. Examples of these types of modifications are described in commonly 

20 assigned U.S. Patent Application Serial Nos. 10/431,027 and 10/613,077, each of . 
which is incorporated by reference herein. 

As described above, even when one select&atleasttwo siRNAs at random, the 
effectiveness of the two may be greater than one would predict based on the 

25 effectiveness of two individual siRNAs. This additive or synergistic effect is 
particularly noticeable as one increases to at least three siRNAs, and even more 
noticeable as one moves to at least four siRNAs. Surprisingly, the pooling of the non- 
functional and semi-functional siRNAs, particularly more than five siRNAs, can lead 
to a silencing mixture that is as effective if not more effective than any one particular 

30 functional siRNA. 
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Within the kit of the present invention, preferably each siRNA will be present 
in a concentration of between 0.001 and 200 |nM, more preferably between 0.01 and 
200 nM, and most preferably between 0. 1 and 1 0 nM. 

5 In addition to preferably comprising at least four or five siRNAs, the kit of the 

present invention will also preferably comprise a buffer to keep the siRNA duplex 
stable. Persons skilled in the art are aware of buffers suitable for keeping siRNA 
stable. For example, the buffer may be comprised of 1 00 mM KC1, 30 mM HEPES- 
pH 7.5, and 1 mM MgC^. Alternatively, kits might contain complementary strands 
10 that contain any one of a number of chemical modifications (e.g. a 2 5 -O- ACE) that 
protect the agents from degradation by nucleases. In this instance, the user may (or 
may not) remove the modifying protective group (e.g. deprotect) before annealing the 
two complementary strands together. 

15 By way of example, the kit may be organized such that pools of siRNA 

duplexes are provided on an array or microarray of wells or drops for a particular 
gene set or for unrelated genes. The array may, for example, be in 96 wells, 384 wells 
or 1284 wells arrayed in a plastic plate or on a glass slide using techniques now 
known or that come to be known to persons skilled in the art. Within an array, 

20 preferably there^will be controls such as functional anti-lamin A/C, cyclophilin and 
two siRNA duplexes that are not specific to the gene of interest. 

In order to ensure stability of the siRNA pootepriar-to^usage, they may be 
retained in lyophilized form at minus twenty degrees (— 20°C) until they are ready for 

25 use. Prior to usage, they should be resuspended; however, even once resuspended, for 
example, in the aforementioned buffer, they should be kept at minus twenty degrees, 
(— 20°C) until used. The aforementioned buffer, prior to use, may be stored at 
approximately 4°C or room temperature. Effective temperatures at which to conduct 
transfections are well known to persons skilled in the art and include for example, 

30 room temperature. 

The kit may be applied either in vivo or in vitro. Preferably, the siRNA of the 
pools or kits is applied to a cell through transfection, employing standard transfection 
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protocols. These methods are well known to persons skilled in the art and include the 
use of lipid-based carriers, electroporation, cationic carriers, and microinjection. 
Further, one could apply the present invention by synthesizing equivalent DNA 
sequences (either as two separate, complementary strands, or as hairpin molecules) 
5 instead of siRNA sequences and introducing them into cells through vectors. Once in 
the cells, the cloned DNA could be transcribed, thereby forcing the cells to generate 
the siRNA. Examples of vectors suitable for use with the present application include 
but are not limited to the standard transient expression vectors, adenoviruses, 
retroviruses, lentivirus-based vectors, as well as other traditional expression vectors. 
10 Any vector that has an adequate siRNA expression and procession module may be 
used. Furthermore, certain chemical modifications to siRNAs, including but not 
limited to conjugations to other molecules, may be used to facilitate delivery. For 
certain applications it may be preferable to deliver molecules without transfection by 
simply formulating in a physiological acceptable solution. 

15 

This embodiment may be used in connection with any of the aforementioned 
embodiments. Accordingly, the sequences within any pool may be selected by 
rational design. 



20 

Multigene Silencing 

In addition to developing kits that contain multiple siRNA directed against a 
single gene, another embodiment includes the use of multiple*sjRiJA targeting 
multiple genes. Multiple genes may be targeted through the use of high- or hyper- 

25 functional siRNA. High- or hyper- functional siRNA that exhibit increased potency, 
require lower concentrations to induce desired phenotypic (and thus therapeutic) 
effects. This circumvents RISC saturation. It therefore reasons that if lower 
concentrations of a single siRNA are needed for knockout or knockdown expression 
of one gene, then the remaining (uncomplexed) RISC will be free and available to 

30 interact with siRNA directed against two, three, four, or more, genes. Thus in this 
embodiment, the authors describe the use of highly functional or hyper-functional 
siRNA to knock out three separate genes. More preferably, such reagents could be 
combined to knockout four distinct genes. Even more preferably, highly functional or 
hyperfunctional siRNA could be used to knock out five distinct genes. Most 



WO 2004/045543 PCT/US2003/036787 

53 

preferably, siRNA of this type could be used to knockout or knockdown the 
expression of six or more genes. 



Hyperfiinctional siRNA 
5 The term hyperfiinctional siRNA (hf-siRNA) describes a subset of the siRNA 

population that induces RNAi in cells at low- or sub-nanomolar concentrations for 
extended periods of time. These traits, heightened potency and extended longevity of 
the RNAi phenotype, are highly attractive from a therapeutic standpoint. Agents 
having higher potency require lesser amounts of the molecule to achieve the desired 
1 0 physiological response, thus reducing the probability of side effects due to "off- 
target" interference. In addition to the potential therapeutic benefits associated with 
hyperfiinctional siRNA, hf-siRNA are also desirable^rom an economic position. 
Hyperfiinctional siRNA may cost less on a per-treatment basis, thus reducing the 
overall expenditures to both the manufacturer and the consumer. 

1 5 Identification of hyperfiinctional siRNA involves multiple steps that are 

designed to examine an individual siRNA agent's concentration- and/or longevity- 
profiles. In one non-limiting example, a population of siRNA directed against a single 
gene are first analyzed using the previously described algorithm (Formula VIII). 
Individual siRNA are then introduced into a test cell line and assessed for the ability 

20 to degrade the target mRNA. It is important to note that when performing this step it 
is not necessary to test all of the siRNA. Instead, it is sufficient to test only those 
siRNA having the highest SMARTscores™ {i.e. SMARTscore™ > -10). 
Subsequently, the gene silencing data is plotted against the SMARTscores™ (see 
Figure 9). SiRNA that (1) induce a high degree of gene silencing (i.e. they induce 

25 greater than 80% gene knockdown) and (2) have superior SMARTscores™ (i.e. a 

SMARTscore™ of > -10, suggesting a desirable average internal stability profile) are 
selected for further investigations designed to better understand the molecule's 
potency and longevity. In one, non-limiting study dedicated to understanding a 
molecule's potency, an siRNA is introduced into one (or more) cell types in 

30 increasingly diminishing concentrations (e.g. 3.0 0.3 nM). Subsequently, the level 
of gene silencing induced by each concentration is examined and siRNA that exhibit 
hyperfiinctional potency (i.e. those that induce 80% silencing or greater at e.g. 
picomolar concentrations) are identified. In a second study, the longevity profiles of 
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siRNA having high (>-l 0) SMARTscores™ and greater than 80% silencing are 
examined. In one non-limiting example of how this is achieved, siRNA are introduced 
into a test cell line and the levels of RNAi are measured over an extended period of 
time (e.g. 24-168 hrs). SiRNAs that exhibit strong RNA interference patterns (i.e. 

5 >80 % interference) for periods of time greater than, e.g., 1 20 hours are thus 

identified. Studies similar to those described above can be performed on any and all 
of the >10 6 siRNA included in this document to further define the most functional 
molecule for any given gene. Molecules possessing one or both properties (extended 
longevity and heightened potency) are labeled "hyperfunctional siRNA," and 

1 0 earmarked as candidates for future therapeutic studies. 

While the example(s) given above describe one means by which 
hyperfunctional siRNA can be isolated, neither the assays themselves nor the 
selection parameters used are rigid and can vary with each family of siRNA. Families 
15 of siRNA include siRNAs directed against a single gene, or directed against a related 
family of genes. 



The highest quality siRNA achievable for any given gene may vary 
considerably. Thus, for example, in the case of one gene (gene X), rigorous studies 

20 such as those described above may enable the identification of an siRNA that, at 
picomolar concentrations, induces 99 + % silencing for a period of 10 days. Yet 
identical studies of a second gene (gene Y) may yield an siRNA that at high 
nanomolar concentrations (e.g. lOOnM) induces only 75% silencing fo^period of 2 
days. Both molecules represent the very optimum siRNA for their respective gene 

25 targets and therefore are designated "hyperfunctional." Yet due to a variety of factors 
including but not limited to target concentration, siRNA stability, cell type, off-target 
interference, and others, equivalent levels of potency and longevity are not 
achievable. Thus, for these reasons, the parameters described in the before mentioned 
assays, can vary. While the initial screen selected siRNA that had SMARTscores™ 

30 above -10 and a gene silencing capability of greater than 80%, selections that have 
stronger (or weaker) parameters can be implemented. Similarly, in the subsequent 
studies designed to identify molecules with high potency and longevity, the desired 
cutoff criteria (i.e. the lowest concentration that induces a desirable level of 
interference, or the longest period of time that interference can be observed) can vary. 
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The experimentation subsequent to application of the rational criteria of this 
application is significantly reduced where one is trying to obtain a suitable 
hyperfunctional siRNA for, for example, therapeutic use. When, for example, the 
additional experimentation of the type described herein is applied by one skilled in the 
5 art with this disclosure in hand, a hyperfunctional siRNA is readily identified. 



The siRNA may be introduced into a cell by any method that is now known or 
that comes to be known and that from reading this disclosure, persons skilled in the 
art would determine would be useful in connection with the present invention in 

10 enabling siRNA to cross the cellular membrane. These methods include, but are not 
limited to, any manner of transfection, such as for example transfection employing 
DEAE-Dextran, calcium phosphate, cationic lipids/liposomes-, micelles, manipulation 
of pressure, microinjection, electroporation, immunoporation, use of vectors such as 
viruses, plasmids, cosmids, bacteriophages, cell fusions, and coupling of the 

1 5 polynucleotides to specific conjugates or ligands such as antibodies, antigens, or 

receptors, passive introduction, adding moieties to the siRNA that facilitate its uptake, 
and the like. 

Having described the invention with a degree of particularity, examples will 
20 now be provided. These examples are not intended to and should not be construed to 
limit the scope of the claims in any way. 



Examples ...... 

25 General Techniques and Nomenclatures 

siRNA nomenclature. All siRNA duplexes are referred to by sense strand. The first 
nucleotide of the 5 '-end of the sense strand is position 1, which corresponds to 
position 1 9 of the antisense strand for a 1 9-mer. In most cases, to compare results 
from different experiments, silencing was determined by measuring specific transcript 
30 mRNA levels or enzymatic activity associated with specific transcript levels, 24 hours 
post-transfection, with siRNA concentrations held constant at 100 nM. For all 
experiments, unelss otherwise specified transfection efficiency was ensured to be over 
95%, and no detectable cellular toxicity was observed. The following system of 
nomenclature was used to compare and report siRNA-silencing functionality: "F" 
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followed by the degree of minimal knockdown. For example, F50 signifies at least 
50% knockdown, F80 means at least 80%, and so forth. For this study, all sub-F50 
siRNAs were considered non-functional. 



5 Cell culture and transfection. 96-well plates are coated with 50 jil of 50 mg/ml poly- 
L-lysine (Sigma) for 1 hr, and then washed 3X with distilled water before being dried 
for 20 min. HEK293 cells or HEK293Lucs or any other cell type of interest are 
released from their solid support by trypsinization, diluted to 3.5 X 10 5 cells/ml, 
followed by the addition of 100 \iL of cells/well. Plates are then incubated overnight 

10 at 37° C, 5% CO2. Transfection procedures can vary widely depending on the cell 
type and transfection reagents. In one non-limiting example, a transfection mixture 
consisting of 2 mL Opti-MEM I (Gibco-BRL), 80 \xl LipofectamTne 2000 
(Invitrogen), 15 |xL SUPERNasin at 20 U/fxl (Ambion), and 1.5 jlxI of reporter gene 
plasmid at 1 \xg/ \xl is prepared in 5-ml polystyrene round bottom tubes. 100 pi of 

1 5 transfection reagent is then combined with 100 \xl of siRNAs in polystyrene deep-well 
titer plates (Beckman) and incubated for 20 to 30 min at room temp. 550 pi of Opti- 
MEM is then added to each well to bring the final siRNA concentration to 100 nM. 
Plates are then sealed with parafilm and mixed. Media is removed from HEK293 
cells and replaced with 95 pi of transfection mixture. Cells are incubated overnight at 

20 37° C, 5% C0 2 . 



Quantification of gene knockdown. A variety of quantification procedures can be 
used to measure the level of silencing induced by siRNA or siRNA pools. In one non- 
limiting example: to measure mRNA levels 24 hrs post-transfection, QuantiGene 
25 branched-DNA (bDNA) kits (Bayer) (Wang, et al f Regulation of insulin preRNA 
splicing by glucose. Proc Natl Acad Sci 1997, 94:4360.) are used according to 
manufacturer instructions. To measure luciferase activity, media is removed from 
HEK293 cells 24 hrs post-transfection, and 50 |_tl of Steady-GLO reagent (Promega) 
is added. After 5 min, plates are analyzed on a plate reader. 

30 

Example I. Sequences Used to Develop the Algorithm. 

Anti-Firefly and anti-Cyclophilin siRNAs panels (Figure 5a, b) sorted 
according to using Formula VIII predicted values. All siRNAs scoring more than 0 



WO 2004/045543 PCT7US2003/036787 

57 

(formula VIII) and more then 20 (formula IX) are fully functional. All ninety 
sequences for each gene (and DBI) appear below in Table III. 



5 TABLE III 



Cyclo 


1 


SEQ. ID 0032 


GUUCCAAAAACAGUGGAUA 


Cyclo 


2 


SEQ. ID 0033 


UCCAAAAACAGUGGAUAAU 


Cyclo 


3 


SEQ. ID 0034 


CAAAAACAGUGGAUAAUUU 


Cyclo 


4 


SEQ. ID 0035 


AAAACAGUGGAUAAUUUUG 


Cyclo 


5 


SEQ. ID 0036 


AACAGUGGAUAAUUUUGUG 


Cyclo 


6 


SEQ. ID 0037 


CAGUGGAUAAUUUUGUGGC 


Cyclo 


7 


SEQ. ID 0038 


GUGGAUAAUUUUGUGGCCU 


Cyclo 


8 


SEQ. ID 0039 


GGAUAAUUUUGUGGCCUUA 


Cyclo • 


9 


SEQ. ID 0040 


AUAAUUUUGUGGCCUUAGC 


Cyclo 


10 


SEQ. ID 0041 


AAUUUUGUGGCCUUAGCUA 


Cyclo 


11 


SEQ. ID 0042 


UUUUGUGGCCUUAGCUACA 


Cyclo 


12 


SEQ. ID 0043 


UUGUGGCCUUAGCUACAGG 


Cyclo 


13 


SEQ. ID 0044 


GUGGCCUUAGCUACAGGAG 


Cyclo 


14 


SEQ. ID 0045 


GGCCUUAGCUACAGGAGAG 


Cyclo 


15 


SEQ. ID 0046 


CCUUAGCUACAGGAGAGAA 


Cyclo 


16 


SEQ. ID 0047 


UUAGCUACAGGAGAGAAAG 


Cyclo 


17 


SEQ. ID 0048 


AGCUACAGGAGAGAAAGGA 


Cyclo 


18 


SEQ, ID 0049 


CUACAGGAGAGAAAGGAUU 


Cyclo 


19 


SEQ. ID 0050 


ACAGGAGAGAAAGGAUUUG 


Cyclo 


20 


SEQ. ID 0051 


AGGAGAGAAAGGAUUUGGC 


Cyclo 


21 


SEQ. ID 0052 


GAGAGAAAGGAUUUGGCUA 


Cyclo 


22 


SEQ. ID 0053 


GAGAAAGGAUUUGGCUACA 


Cyclo 


23 


SEQ. ID 0054 


GAAAGGAUUUGGCUACAAA 


Cyclo 


24 


SEQ. ID 0055 


AAGGAUUUGGCUACAAAAA 


Cyclo 


25 


SEQ. ID 0056 


GGAUUUGGCUACAAAAACA 


Cyclo 


26 


SEQ. ID 0057 


AUUUGG CUAC AAAAAC AGC 


Cyclo 


27 


SEQ. ID 0058 


UUGGCUAC AAAAAC AG C AA 


Cyclo 


28 


SEQ. ID 0059 


GGCUACAAAAACAGCAAAU 


Cyclo 


29 


SEQ. ID 0060 


CUACAAAAACAGCAAAUUC 


Cyclo 


30 


SEQ. ID 0061 


ACAAAAACAGCAAAUUCCA 


Cyclo 


31 


SEQ. ID 0062 


AAAAACAGCAAAUUCCAUC 


Cyclo 


32 


SEQ. ID 0063 


AAACAGCAAAUUCCAUCGU 


Cyclo 


33 


SEQ. ID 0064 


ACAGCAAAUUCCAUCGUGU 


Cyclo 


34 


SEQ. ID 0065 


AG C AAAUUCC AUCGUGU AA 
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Cyclo 


35 


SEQ. ED 0066 


CAAAUUCCAUCGUGUAAUC 


Cyclo 


36 


SEQ. ID 0067 


AAUUCCAUCGUGUAAUCAA 


Cyclo 


37 


SEQ. ID 0068 


UUCCAUCGUGUAAUCAAGG 


Cyclo 


38 


SEQ. ID 0069 


CCAUCGUGUAAUCAAGGAC 


Cyclo 


39 


SEQ. ID 0070 


AUCGUGUAAUCAAGGACUU 


Cyclo 


40 


SEQ. ID 0071 . 


CGUGUAAUCAAGGACUUCA 


Cyclo 


41 


SEQ. ID 0072 


UGUAAUCAAGGACUUCAUG 


Cyclo 


42 


SEQ. ID 0073 


UAAUCAAGGACUUCAUGAU 


Cyclo 


43 


SEQ. ID 0074 


AUCAAGGACUUCAUGAUCC 


Cyclo 


44 


SEQ. ID 0075 


. CAAGGACUUCAUGAUCCAG 


Cyclo 


45 


SEQ. ID 0076 


AGGACUUCAUGAUCCAGGG 


Cyclo 


•46 


SEQ. ID 0077 


GACUUCAUGAUCCAGGGCG 


Cyclo 


47 


SEQ. ID 0078 


CUUCAUGAUCCAGGGCGGA 


Cyclo 


48 


SEQ. ID 0079 


UCAUGAUCCAGGGCGGAGA 


Cyclo 


49 


SEO ID 0080 


AUGAUCCAGGGCGGAGACU 


Cyclo 


50 


SEQ. ID 008 1 


GAUCCAGGGCGGAGACUUC 


Cyclo 


51 


SEQ. ID 0082 


UCCAGGGCGGAGACUUCAC 


Cyclo 


52 


SEQ. ID 0083 


CAGGGCGGAGACUUCACCA 


Cyclo 


53 


SEQ. ID 0084 


GGGCGGAGACUUCACCAGG 


Cyclo 


54 


SEQ. ID 0085 


GCGGAGACUUCACCAGGGG 


Cyclo 


55 


SEQ. ID 0086 


GGAGACUUCACCAGGGGAG 


Cyclo 


56 


SEQ. ID 0087 


AGACUUCACCAGGGGAGAU 


Cyclo 


57 


SEQ. ID 0088 


: ACUUCACCAGGGGAGAUGG 


Cyclo 


58 


SEQ. ID 0089 


UUCACCAGGGGAGAUGGCA 


Cyclo 


59 


SEQ. ID 0090 


CACCAGGGGAGAUGGCACA 


Cyclo 


60 


SEQ. ID 0091 


CCAGGGGAGAUGGCACAGG 


Cyclo 


61 


SEQ. ID 0092 


AGGGGAGAUGGCACAGGAG 


Cyclo 


62 


SEQ. ID 0093 


GGGAGAUGGCACAGGAGGA 


Cyclo 


63 


SEQ. ID 0094 


GAGAUGGCACAGGAGGAAA 


Cyclo 


64 


SEQ. ID 0095 


GAUGGCACAGGAGGAAAGA 


Cyclo 


65 


SEQ. ID 0094 


UGGCACAGGAGGAAAGAGC 


Cyclo 


66 


SEQ. ID 0096 


GCACAGGAGGAAAGAGCAU 


Cyclo 


67 


SEQ. ID 0097 


ACAGGAGGAAAGAGCAUCU 


Cyclo 


68 


SEQ. ID 0098 


AGGAGGAAAGAGCAUCUAC 


Cyclo 


69 


SEQ. ID 0099 


GAGGAAAGAGCAUCUACGG 


Cyclo 


70 


SEQ. ID 0100 


GGAAAGAGCAUCUACGGUG 


Cyclo 


71 


SEQ. ID 0101 


AAAGAGCAUCUACGGUGAG 


Cyclo 


72 


SEQ. ID 0102 


AGAGCAUCUACGGUGAGCG 


Cyclo 


73 


SEQ. ID 0103 


AGCAUCUACGGUGAGCGCU 
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Cyclo 


74 


SEQ. ID 0104 


Cyclo 


75 


SEQ.ID0105 


Cyclo 


76 


SEQ. ID 0106 


Cyclo 


77 


SEQ. ID 0107 


Cyclo 


78 


SEQ. ID 0108 


Cyclo 


79 


SEQ. ID 0109 


Cyclo 


80 


SEQ. ID 0110 


Cyclo 


81 


SEQ. ID 0111 


Cyclo 


82 


SEQ. ID 0112 


Cyclo 


83 


SEQ. ID 01 13 


Cyclo 


84 


SEQ. ID 01 14 


Cyclo 


85 


SEQ. ID 01 15 


Cyclo 


86 


SEQ. ED 01 16 


Cyclo 


87 


SEQ. ID 0117 


Cyclo 


88 


SEQ. ED 0118 


Cyclo 


89 


SEQ. ID 0119 


Cyclo 


90 


SEQ. ID 0120 


DB 


1 


SEQ. ID 0121 


DB 


2 


SEQ. ED 0122 


DB 


3 


SEQ. ID 0123 


DB 


4 


SEQ. ID 0124 


DB 


5 


SEQ. ID 0125 


DB 


6 


SEQ. ID 0126 


DB 


7 


SEQ. ID 0127 


DB 


8 


SEQ. ID 0128 


DB 


9 . 


SEQ. ID 0129 


DB 


10 


SEQ. ID 0130 


DB 


11 


SEQ. ID 0131 


DB 


12 


SEQ. ID 0132 


DB 


13 


SEQ. ID 0133 


DB 


14 


SEQ. ED 0134 


DB 


15 


SEQ. ID 0135 


DB 


16 


SEQ. ID 0136 


DB 


17 


SEQ. ID 0137 


DB 


18 


SEQ. ID 0138 


DB 


19 


SEQ. ID 0139 


DB 


20 


SEQ. 3D 0140 


DB 


21 


SEQ. ED 0141 


DB 


22 


SEQ. ED 0142 


DB 


23 


SEQ. ID 0143 
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CAUCUACGGUGAGCGCUUC 

UCUACGGUGAGCGCUUCCC 

UACGGUGAGCGCUUCCCCG 

CGGUGAGCGCUUCCCCGAU 

GUGAGCGCUUCCCCGAUGA 

GAGCGCUUCCCCGAUGAGA 

GCGCUUCCCCGAUGAGAAC 

GCUUCCCCGAUGAGAACUU 

UUCCCCGAUGAGAACUUCA 

CCCCGAUGAGAACUUCAAA 

CCGAUGAGAACUUCAAACU 

GAUGAGAACUUCAAACUGA 

UGAGAACUUCAAACUGAAG 

AGAACUUCAAACUGAAGCA . 

AACUUCAAACUGAAGCACU 

CUUCAAACUGAAGCACUAC 

UCAAACUGAAGCACUACGG 

ACGGGCAAGGCCAAGUGGG 

CGGGCAAGGCCAAGUGGGA 

GGGCAAGGCCAAGUGGGAU 

GGCAAGGCCAAGUGGGAUG 

GCAAGGCCAAGUGGGAUGC 

CAAGGCCAAGUGGGAUGCC 

AAGGCCAAGUGGGAUGCCU 

AGGCCAAGUGGGAUGCCUG 

GGCCAAGUGGGAUGCCUGG 

GCCAAGUGGGAUGCCUGGA 

CCAAGUGGGAUGCCUGGAA 

CAAGUGGGAUGCCUGGAAU 

AAGUGGGAUGCCUGGAAUG 

AGUGGGAUGCCUGGAAUGA 

GUGGGAUGCCUGGAAUGAG 

UGGGAUGCCUGGAAUGAGC 

GGGAUGCCUGGAAUGAGCU 

GG AUG CCUGG AAUG AGCUG 

GAUGCCUGGAAUGAGCUGA 

AUGCCUGGAAUGAGCUGAA 

UGCCUGG AAUG AGCUG AAA 

GCCUGGAAUGAGCUGAAAG 

CCUG G AAUG AG CUG AAAGG 
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DB 


24 


SEQ. ID 0144 


CUGGAAUGAGCUGAAAGGG 


DB 


25 


SEQ. ID 0145 


UG G AAUG AGCUGAAAGG G A 


DB 


26 


SEQ. ID 0146 


GGAAUGAGCUGAAAGGGAC 


DB 


27 


SEQ. ID 0147 


GAAUGAGCUGAAAGGGACU 


DB 


28 


SEQ. ID 0148 


AAUGAGCUGAAAGGGACUU 


DB 


29 


SEQ. ID 0149 


AUGAGCUGAAAGGGACUUC 


DB 


30 


SEQ. ID 0150 


UGAGCUGAAAGGGACUUCC 


DB 


31 


SEQ. ID 0151 


GAGCUGAAAGGGACUUCCA 


DB 


32 


SEQ. ID 0152 


AGCUGAAAGGGACUUCCAA 


DB 


33 


SEQ. ID 0153 


GCUGAAAGGGACUUCCAAG 


DB 


34 


SEQ. E) 0154 


CUGAAAGGGACUUCCAAGG 


DB 


35 


SEQ. ID 0155 


UGAAAGGGACUUCCAAGGA 


DB 


36 


SEQ. ID 0156 


GAAAGGGACUUCCAAGGAA 


DB 


37 


SEQ. ID 0157 


AAAGGGACUUCCAAGGAAG 


DB 


38 


SEQ. ID 0158 


AAGGGACUUCCAAGGAAGA 


DB 


39 


SEQ. ID 0159 


AGGGACUUCCAAGGAAGAU 


DB 


40 


SEQ. ID 0160 


G GG ACUUCCAAGG AAG AUG 


DB 


41 


SEQ. ID 0161 


GGACUUCCAAGGAAGAUGC 


DB 


42 


SEQ. ID 0162 


GACUUCCAAGGAAGAUGCC 


DB 


43 


SEQ. ID 0163 


ACUUCCAAGGAAGAUGCCA 


DB 


• 

44 


SEQ. ID 0164 


CUUCCAAGGAAGAUGCCAU 


DB 


45 


SEQ. ID 0165 


UUCCAAGGAAGAUGCCAUG 


DB 


46 


SEQ. ID 0166 


UCCAAGGAAGAUGCCAUGA 


DB 


47 


SEQ. ID 0167 


CGAAGGAAGAUGCCAUGAA 


DB 


48 


SEQ. ID 0168 


CAAGGAAGAUGCCAUGAAA 


DB 


49 


SEQ. ID 0169 


AAGGAAGAUGCCAUGAAAG 


DB 


50 


SEQ. ID 0170 


AGGAAGAUGCCAUGAAAGC 


DB 


51 


SEQ. ID 0171 


GGAAGAUGCCAUGAAAGCU 


DB 


52 


SEQ. ID 0172 


GAAGAUGCCAUGAAAGCUU 


DB 


53 


SEQ. ID 0173 


AAGAUGCCAUGAAAGCUUA 


DB 


54 


SEQ. ID 0174 


AGAUGCCAUGAAAGCUUAC 


DB 


55 


SEQ. ID 0175 


GAUGCCAUGAAAGCUUACA 


DB 


56 


SEQ. ID 0176 


AUGCCAUGAAAGCUUACAU 


DB 


57 


SEQ. ID 0177 


UGCCAUGAAAGCUUACAUC 


DB - 


58 


SEQ. ID 0178 


GCCAUGAAAGCUUACAUCA 


DB 


59 


SEQ. ID 0179 


CCAUGAAAGCUUACAUCAA 


DB 


60 


SEQ. ID 0180 


CAUGAAAGCUUACAUCAAC 


DB 


61 


SEQ. ID 0181 


AUGAAAGCUUACAUCAACA 


DB 


62 


SEQ. ID 0182 


UG AAAG CUU AC AUG AAC AA 


DB 


63 


SEQ. ID 0183 


GAAAGCUUACAUCAACAAA 
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DB 


64 


SEQ. ID 0184 


AAAGCUUACAUCAACAAAG 


DB 


65 


SEQ. ID 0185 


AAGCUUACAUCAACAAAGU 


DB 


66 


SEQ.ID0186 


AGCUUACAUCAACAAAGUA 


DB 


67 


SEQ. ID 0187 


GCUUACAUCAACAAAGUAG 


DB 


68 


SEQ. ED 0188 


CUUACAUCAACAAAGUAGA 


DB 


69 


SEQ.ID0189 


UUACAUCAACAAAGUAGAA 


DB 


70 


SEQ. ID 0190 


UACAUCAACAAAGUAGAAG 


DB 


71 


SEQ.ID0191 


ACAUCAACAAAGUAGAAGA 


DB 


72 


SEQ. ID 0192 


CAUCAACAAAGUAGAAGAG 


DB 


73 


SEQ. ED 0193 


AUCAACAAAGUAGAAGAGC 


DB 


74 


SEQ. ID 0194 


UCAACAAAGUAGAAGAGCU 


DB 


75 


SEQ. ID 0195 


CAACAAAGUAGAAGAGCUA 


DB 


76 


SEQ. ID 0196 


AACAAAGUAGAAGAGCUAA 


DB 


77 


SEQ. ID 0197 


ACAAAGUAGAAGAGCUAAA 


DB 


78 


SEQ. ID 0198 


CAAAGUAGAAGAGCUAAAG 


DB 


79 


SEQ. ID 0199 


AAAGUAGAAGAGCUAAAGA 


DB 


80 


SEQ. ID 0200 


AAGUAGAAGAGCUAAAGAA 


DB 


81 


SEQ. ID 0201 


AGUAGAAGAGCUAAAGAAA 


DB 


82 


SEQ. ID 0202 


GUAGAAGAGCUAAAGAAAA 


DB 


83 


SEQ. ID 0203 


UAGAAGAGCUAAAGAAAAA 


DB 


84 


SEQ. ID 0204 


AGAAGAGCUAAAGAAAAAA 


DB 


85 


SEQ. ID 0205 


GAAGAGCUAAAGAAAAAAU 


DB 


86 


SEQ. ID 0206 


AAG AG CU AAAG AAAAAAUA 


DB 


87 


SEQ. ID 0207 


AGAGCUAAAGAAAAAAUAC 


DB 


88 


SEQ. ID 0208 


GAGCUAAAGAAAAAAUACG 


DB 


89 


SEQ. ID 0209 


AGCUAAAGAAAAAAUACGG 


DB 


90 


SEQ. ID 0210 


GCUAAAGAAAAAAUACGGG 


Luc * 


1 


SEQ. ID 021 1 


AUCCUCAUAAAGGCCAAGA 


Luc 


2 


SEQ. ID 0212 


AGAUCCUCAUAAAGGCCAA 


Luc 


3 


SEQ. ID 0213 


AGAGAUCCUCAUAAAGGCC 


Luc 


4 


SEQ. ED 0214 


AGAGAGAUCCUCAUAAAGG 


Luc 


5 


SEQ. ID 0215 


UCAGAGAGAUCCUCAUAAA 


Luc 


6 


SEQ. ID 0216 


AAUCAGAGAGAUCCUCAUA 


Luc 


7 


SEQ. ID 0217 


AAAAUCAGAGAGAUCCUCA 


Luc 


8 


SEQ. ID 0218 


GAAAAAUCAGAGAGAUCCU 


Luc 


9 


SEQ. ID 0219 


AAGAAAAAUCAGAGAGAUC 


Luc 


10 


SEQ. ID 0220 


GCAAGAAAAAUCAGAGAGA 


Luc 


11 


SEQ. ED 0221 


ACGCAAGAAAAAUCAGAGA 


Luc 


12 


SEQ. ID 0222 


CGACGCAAGAAAAAUCAGA 


Luc 


13 


SEQ. ID 0223 


CUCGACGCAAGAAAAAUCA 
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Luc 


14 


SEQ. ID 0224 


AACUCGACGCAAGAAAAAU 


Luc 


15 


SEQ. ID 0225 


AAAACUCGACGCAAGAAAA 


Luc 


16 


SEQ. ID 0226 


GGAAAACUCGACGCAAGAA 


Luc 


17 


SEQ. ID 0227 


CCGGAAAACUCGACGCAAG 


Luc 


18 


SEQ. ID 0228 


UACCGGAAAACUCGACGCA 


Luc 


19 


SEQ. ID 0229 


CUUACCGGAAAACUCGACG 


Luc 


20 


SEQ. ID 0230 


GUCUUACCGGAAAACUCGA 


Luc 


21 


SEQ. ID 0231 


AGGUCUUACCGGAAAACUC 


Luc 


22 


SEQ. ID 0232 


AAAGGUCUUACCGGAAAAC 


Luc 


23 


SEQ. ED 0233 


CGAAAGGUCUUACCGGAAA 


Luc 


24 


SEQ. ID 0234 


ACCGAAAGGUCUUACCGGA 


Luc 


25 


SEQ. ID 0235 


GUACCGAAAGGUCUUACCG 


Luc 


26 


SEQ. ID 0236 


AAGUACCGAAAGGUCUUAC 


Luc 


27 


SEQ. 3D 0237 


CGAAGUACCGAAAGGUCUU 


Luc 


28 


SEQ. ID 0238 


GACGAAGUACCGAAAGGUC 


Luc 


29 


SEQ. ID 0239 


UGGACGAAGUACCGAAAGG 


Luc 


30 


SEQ. ID 0240 


UGUGGACGAAGUACCGAAA 


Luc 


31 


SEQ. ID 0241 


UUUGUG G ACG AAGUACCG A 


Luc 


32 


SEQ. ID 0242 


UGUUUGUGGACGAAGUACC 


Luc 


33 


SEQ. ID 0243 


UGUGUUUGUGGACGAAGUA 


Luc 


34 


SEQ. ID 0244 


GUUGUGUUUGUGGACGAAG 


Luc 


35 


SEQ. ID 0245 


GAGUUGUGUUUGUGGACGA 


Luc 


36 


SEQ. ID 0246 


AGGAGUUGUGUUUGUGGAC 


Luc 


37 


SEQ. ID 0247 


GGAGGAGUUGUGUUUGUGG 


Luc 


38 


SEQ. ID 0248 


GCGGAGGAGUUGUGUUUGU 


Luc 


39 


SEQ. ID 0249 


GCGCGGAGGAGUUGUGUUU 


Luc 


40 


SEQ. ID 0250 


UUGCGCGGAGGAGUUGUGU 


Luc 


41 


SEQ. ID 0251 


AGUUGCGCGGAGGAGUUGU 


Luc 


42 


SEQ. ID 0252 


AAAGUUGCGCGGAGGAGUU 


Luc 


43 


SEQ. ID 0253 


AAAAAGUUGCGCGGAGGAG 


Luc 


44 


SEQ. ID 0254 


CGAAAAAGUUGCGCGGAGG 


Luc 


45 


SEQ. ID 0255 


CGCGAAAAAGUUGCGCGGA 


Luc 


46 


SEQ. ID 0256 


ACCGCGAAAAAGUUGCGCG 


Luc 


47 


SEQ. ID 0257 


CAACCGCGAAAAAGUUGCG 


Luc 


48 


SEQ. ID 0258 


AACAACCGCGAAAAAGUUG 


Luc 


49 


SEQ. ID 0259 


GUAACAACCGCGAAAAAGU 


Luc 


50 


SEQ. ID 0260 


AAGUAAC AACCG CG AAAAA 


Luc 


51 


SEQ. ID 0261 


UCAAGUAACAACCGCGAAA 


Luc 


52 


SEQ. ID 0262 


AGUCAAGUAACAACCGCGA 


Luc 


53 


SEQ. ID 0263 


CCAGUCAAGUAACAACCGC 
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Luc 


54 


SEQ. ID 0264 


CGCCAGUCAAGUAACAACC 


Luc 


55 


SEQ. ID 0265 


GUCGCCAGUCAAGUAACAA 


Luc 


56 


SEQ. ID 0266 


ACGUCGCCAGUCAAGUAAC 


Luc 


57 


SEQ. ID 0267 


UUACGUCGCCAGUCAAGUA 


Luc 


58 


SEQ. ID 0268 


GAUUACGUCGCCAGUCAAG 


Luc 


59 


SEQ. ID 0269 


UGGAUUACGUCGCCAGUCA 


Luc 


60 


SEQ. ID 0270 


CGUGGAUUACGUCGCCAGU 


Luc 


61 


SEQ. ID 0271 


AUCGUGGAUUACGUCGCCA 


Luc 


62 


SEQ. ID 0272 


AGAUCGUGGAUUACGUCGC 


Luc 


63 


SEQ. ID 0273 


AGAGAUCGUGGAUUACGUC 


Luc 


64 


SEQ. ID 0274 


AAAGAGAUCGUGGAUUACG 


Luc 


65 


SEQ. ID 0275 


AAAAAGAGAUCGUGGAUUA 


Luc 


66 


SEQ. ID 0276 


GGAAAAAGAGAUCGUGGAU 


Luc 


67 


SEQ. ID" 0277 


ACGGAAAAAGAGAUCGUGG 


Luc 


68 


SEQ. ID 0278 


UGACGGAAAAAGAGAUCGU 


Luc 


. 69 


SEQ. ID 0279 


GAUGACGGAAAAAGAGAUC 


Luc 


70 


SEQ. ID 0280 


ACGAUGACGGAAAAAGAGA 


Luc 


71 


SEQ. ID 0281 


AGACGAUGACGGAAAAAGA 


Luc 


72 


SEQ. ID 0282 


AAAGACGAUGACGGAAAAA 


Luc 


73 


SEQ. ID 0283 


GGAAAGACGAUGACGGAAA 


Luc 


74 


SEQ. ID 0284 


ACGGAAAGACGAUGACGGA 


Luc 


75 


SEQ. ID 0285 


GCACGGAAAGACGAUGACG 


Luc 


76 


SEQ. ID 0286 


GAGCACGGAAAGACGAUGA 


Luc 


77 


SEQ. ID 0287 


UGGAGCACGGAAAGACGAU 


Luc 


78 


SEQ. ID 0288 


UUUGGAGCACGGAAAGACG 


Luc 


79 


SEQ. ID 0289 


GUUUUGGAGCACGGAAAGA 


Luc 


80 


SEQ. ID 0290 


UUGUUUUGGAGCACGGAAA 


Luc 


81 


SEQ. ID 0291 


UGUUGUUUUGGAGCACGGA 


Luc 


82 


SEQ. ID 0292 


GUUGUUGUUUUGGAGCACG 


Luc 


83 


SEQ. ID 0293 


CCGUUGUUGUUUUGGAGCA 


Luc 


84 


SEQ. ID 0294 


CGCCGUUGUUGUUUUGGAG 


Luc 


85 


SEQ. ID 0295 


GCCGCCGUUGUUGUUUUGG 


Luc 


86 


SEQ. ID 0296 


CCGCCGCCGUUGUUGUUUU 


Luc 


87 


SEQ. ID 0297 


UCCCGCCGCCGUUGUUGUU 


Luc 


88 


SEQ. ID 0298 


CUUCCCGCCGCCGUUGUUG 


Luc 


89 


SEQ. ID 0299 


AACUUCCCGCCGCCGUUGU 


Luc 


90 


SEQ. ID 0300 


UGAACUUCCCGCCGCCGUU 
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Example EL Validation of the Algorithm using DBI, Luciferase, PLK, EGFR, 
and SEAP 

The algorithm (Formula VIII) identified siRNAs for five genes, human DBI, 
firefly luciferase (fLuc), renilla luciferase (rLuc), human PLK, and human secreted 
5 alkaline phosphatase (SEAP). Four individual siRNAs were selected on the basis of 
their SMARTscores™ derived by analysis of their sequence using Formula VIII (all 
of the siRNAs would be selected with Formula IX as well) and analyzed for their 
ability to silence their targets' expression. In addition to the scoring, a BLAST search 
was conducted for each siRNA. To minimize the potential for off-target silencing 

1 0 effects, only those target sequences with more than three mismatches against un- 
related sequences were selected. Semizarov, et al, Specificity of short interfering UNA 
determined through gene expression signatures. Proc. Natl. Acad. Sci. U.S.A. 2003, 
100:6347. These duplexes were analyzed individually and in pools of 4 and 
compared with several siRNAs that were randomly selected. The functionality was 

1 5 measured a percentage of targeted gene knockdown as compared to controls. All 

siRNAs were transfected as described by the methods above at 100 nM concentration 
into HEK293 using Lipofectamine 2000. The level of the targeted gene expression 
was evaluated by B-DNA as described above and normalized to the non-specific 
control. Figure 10 shows that the siRNAs selected by the algorithm disclosed herein 

20 were significantly more potent than randomly selected siRNAs. The algorithm 

increased the chances of identifying an F50 siRNA from 48% to 91%, and an F80 
siRNA from 13% to 57%. In addition, pools of SMART siRNA silence the selected 
target better than randomly selected pools (see Figure 1 OF). 

25 Example III. Validation of the Algorithm Using Genes Involved in Clathrin- 
Dependent Endocytosis. 

Components of clathrin-mediated endocytosis pathway are key to modulating 
intracellular signaling and play important roles in disease. Chromosomal 
rearrangements that result in fusion transcripts between the Mixed-Lineage Leukemia 
30 gene (MLL) and CALM (Clathrin assembly lymphoid myeloid leukemia gene) are 
believed to play a role in leukemogenesis. Similarly, disruptions in Rab7 and Rab9, 
as well as HIP1 (Huntingtin-interacting protein), genes that are believed to be 
involved in endocytosis, are potentially responsible for ailments resulting in lipid 
storage, and neuronal diseases, respectively. For these reasons, siRNA directed 
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against clathrin and other genes involved in the clathrin-mediated endocytotic 
pathway are potentially important research and therapeutic tools. 

siRNAs directed against genes involved in the clathrin-mediated endocytosis 
5 pathways were selected using Formula VIII. The targeted genes were clathrin heavy 
chain (CHC, accession # NMJ)04859), clathrin light chain A (CLCa, NM_001 833), 
clathrin light chain B (CLCb, NM_001 834), CALM (U45976), (32 subunit of AP-2 
(P2 5 NM_001282), EpslS (NM_001981), EpslSR (NM_021235), dynamin II 
(DYNII, NMJ)04945), Rab5a (BC001267), Rab5b (NM__002868), Rab5c 
10 (AF141304), and EEA.l (XM_018197). 



"For each gene, four siRNAs duplexes with the highest scores were selected 
and a BLAST search was conducted for each of them using the Human EST database. 
In order to minimize the potential for off-target silencing effects, only those 
1 5 sequences with more than three mismatches against un-related sequences were used. 
All duplexes were synthesized at Dharmacon, Inc. as 21-mers with 3'-UU overhangs 
using a modified method of 2'-ACE chemistry Scaringe, Advanced 5'-$ilyl-2'- 
orthoester approach to RNA oligonucleotide synthesis, Methods Enzymol 2000, 317:3 
and the antisense strand was chemically phosphorylated to insure maximized activity. 

20 

HeLa cells were grown in Dulbecco's modified Eagle's medium (DMEM) 
containing 10% fetal bovine serum, antibiotics and glutamine. siRNA duplexes were 
resuspended in IX siRNA Universal buffer (Dharmacon, Inc.) to 20|iM prior to 
transfection. HeLa cells in 12- well plates were transfected twice with 4jal of 20pM 

25 siRNA duplex in 3jxl Lipofectamine 2000 reagent (Invitrogen, Carlsbad, California, 
USA) at 24-hour intervals. For the transfections in which 2 or 3 siRNA duplexes were 
included, the amount of each duplex was decreased, so that the total amount was the 
same as in transfections with single siRNAs. Cells were plated into normal culture 
medium 12 hours prior to experiments, and protein levels were measured 2 or 4 days 

30 after the first transfection. 

Equal amounts of lysates were resolved by electrophoresis, blotted, and 
stained with the antibody specific to targeted protein, as well as antibodies specific to 
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unrelated proteins, PP1 phosphatase and TsglOl (not shown). The cells were lysed in 
Triton X-100/glycerol solubilization buffer as described previously. Tebar, 
Bohlander, & Sorkin, Clathrin Assembly Lymphoid Myeloid Leukemia (CALM) 
Protein: Localization in Endocytic-coated Pits, Interactions with Clathrin, and the 
5 Impact of Overexpression on Clathrin-mediated Traffic, Mol. Biol. Cell Aug 1999, 
10:2687. Cell lysates were electrophoresed, transferred to nitrocellulose membranes, 
and Western blotting was performed with several antibodies followed by detection 
using enhanced chemiluminescence system (Pierce, Inc). Several x-ray films were 
analyzed to determine the linear range of the chemiluminescence signals, and the 
1 0 quantifications were performed using densitometry and Alphalmager v5.5 software 
(Alpha Innotech Corporation). In experiments with Epsl5R-targeted siRNAs, cell 
lysates were subjected to immunoprecipitation with Ab860, and EpslSR was detected 
in immunoprecipitates by Western blotting as described above. 



1 5 The antibodies to assess the levels of each protein by Western blot were 

obtained from the following sources: monoclonal antibody to clathrin heavy chain 
(TD.l) was obtained from American Type Culture Collection (Rockville, MD, USA); 
polyclonal antibody to dynamin II was obtained from Affinity Bioreagents, Inc. 
(Golden, CO, USA); monoclonal antibodies to EEA.l and RabSa were purchased 

20 from BD Transduction Laboratories (Los Angeles, CA, USA); the monoclonal 

antibody to TsglOl was purchased from Santa Cruz Biotechnology, Inc. (Santa Cruz, 
CA, USA); the monoclonal antibody to GFP was from ZYMED Laboratories Inc. 
(South San Francisco* CA^USA); the rabbit polyclonal antibodies Ab32 specific to a- 
adaptins and Ab20 to CALM were described previously Sorkin, et al f Stoichiometric 

25 Interaction of the Epidermal Growth Factor Receptor with the Clathrin-associated 
Protein Complex AP-2, J. Biol. Chem. Jan 1995, 270:619, the polyclonal antibodies to 
clathrin light chains A and B were kindly provided by Dr. F. Brodsky (UCSF); 
monoclonal antibodies to PP 1 (BD Transduction Laboratories) and a- Actinin 
(Chemicon) were kindly provided by Dr. M. Dell'Acqua (University of Colorado); 

30 Epsl5 Ab577 and EpslSR Ab860 were kindly provided by Dr. P.P. Di Fiore 
(European Cancer Institute). 
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Figure 11 demonstrates the in vivo functionality of 48 individual siRNAs, 
selected using Formula VIII (most of them will meet the criteria incorporated by 
Formula DC as well) targeting 12 genes. Various cell lines were transfected with 
siRNA duplexes (Dupl-4) or pools of siRNA duplexes (Pool), and the cells were 
5 lysed 3 days after transfection with the exception of CALM (2 days) and P2 (4 days). 

Note a pi-adaptin band (part of AP-1 Golgi adaptor complex) that runs 
slightly slower than £2 adaptin. CALM has two splice variants, 66 and 72 kD. The 
full-length Epsl5R (a doublet of -130 kD) and several truncated spliced forms of ~ 

10 100 kD and -70 kD were detected in Epsl5R immunoprecipitates (shown by arrows). 
The cells were lysed 3 days after transfection. Equal amounts of lysates were 
resolved by electrophoresis and blotted with the antibody specific to a targeted protein 
(GFP antibody for YFP fusion proteins) and the antibody specific to unrelated 
proteins PP1 phosphatase or a-actinin, and TSG101. The amount of protein in each 

1 5 specific band was normalized to the amount of non-specific proteins in each lane of 
the gel. Nearly all of them appear to be functional, which establishes that Formula 
VIII and IX can be used to predict siRNAs' functionality in general in a genome wide 
manner. 

20 To generate the fusion of yellow fluorescent protein (YFP) with RabSb or 

Rab5c (YFP-Rab5b or YFP-Rab5c), a DNA fragment encoding the full-length human 
RabSb or RabSc was obtained by PCR using Pfu polymerase (Stratagene) with a Sad 
restriction site introduced iiitS^the 5 ' end and a Kpnl site into the 3 ' end and cloned 
into pEYFP-Cl vector (CLONTECH, Palo Alto, CA, USA). GFP-CALM and YFP- 

25 RabSa were described previously Tebar, Bohlander, & Sorkin, Clathrin Assembly 

Lymphoid Myeloid Leukemia (CALM) Protein: Localization in Endocytic-coated Pits, 
Interactions with Clathrin, and the Impact of Overexpression on Clathrin-mediated 
Traffic, Mol. Biol. Cell Aug 1999, 10:2687. 

30 Example IIL Validation of the Algorithm Using EgS, GADPH, ATE1, MEK2, 
MEK1, QB, LaminA/C ? c-myc, human cyclophilin, and mouse cyclophilin. 
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A number of genes have been identified as playing potentially important roles 
in disease etiology. Expression profiles of normal and diseased kidneys has 
implicated Edg5 in immunoglobulin A neuropathy, a common renal glomerular 
disease. Mycl, MEK1/2 and other related kinases have been associated with one or 
5 more cancers, while lamins have been implicated in muscular dystrophy and other 
diseases. For these reasons, siRNA directed against the genes encoding these classes 
of molecules would be important research and therapeutic tools. 

Figure 12 illustrates four siRJSfAs targeting 10 different genes (Table V for 
1 0 sequence and accession number information) that were selected according to the 
Formula VIII and assayed as individuals and pools in HEK293 cells. The level of 
siRNA induced silencing was measured using the B-DNA assay. These studies 
demonstrated that thirty-six out of the forty individual SMART-selected siRNA tested 
are functional (90%) and all 10 pools are fully functional. 

Example V. Validation of the Algorithm Using Bcl2 

Bcl-2 is a ~25kD, 205-239 amino acid, anti-apoptotic protein that contains 
considerable homology with other members of the BCL family including BCLX, 
MCL1, BAX, BAD, and BIK. The protein exists in at least two forms (Bcl2a, which 

20 has a hydrophobic tail for membrane anchorage, and Bcl2b, which lacks the 

hydrophobic tail) and is predominantly localized to the mitochondrial membrane. 
While Bcl2 expression is widely distributed, particular interest has focused on the 
expression of this molecule in B^and T cells. Bcl2 expression is down-regulated in 
normal germinal center B cells yet in a high percentage of follicular lymphomas, Bcl2 

25 expression has been observed to be elevated. Cytological studies have identified a 

common translocation ((14;18)(q32;q32)) amongst ahigh percentage (>70%) of these 
lymphomas. This genetic lesion places the Bcl2 gene in juxtaposition to 
immunoglobulin heavy chain gene (IgH) encoding sequences and is believed to 
enforce inappropriate levels of gene expression, and resistance to programmed cell 

30 death in the follicle center B cells. In other cases, hypomethylation of the Bcl2 

promoter leads to enhanced expression and again, inhibition of apoptosis. In addition 
to cancer, dysregulated expression of Bcl-2 has been correlated with multiple sclerosis 
and various neurological diseases. 
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The correlation between Bcl-2 translocation and cancer makes this gene an 
attractive target for RNAi. Identification of siRNA directed against the bcl2 transcript 
(or Bcl2-IgH fusions) would further our understanding Bcl2 gene function and 
possibly provide a future therapeutic agent to battle diseases that result from altered 
5 expression or function of this gene. 



In Silico Identification of Functional siRNA 

To identify functional and hyperfunctional siRNA against the Bcl2 gene, the 
sequence for Bcl-2 was downloaded from the NCBI Unigene database and analyzed 
10 using the Formula VIII algorithm. As a result of these procedures, both the sequence 
and SMARTscores™ of the Bcl2 siRNA were obtained and ranked according to their 
functionality. Subsequently, these sequences were BLAST' ed (database) to insure that 
the selected sequences were specific and contained minimal overlap with unrealated 
genes. The SMARTscores™ for the top 10 Bcl-2 siRNA are identified in Figure 13. 

15 

In Vivo Testing of Bcl-2 SiRNA 

Bcl-2 siRNAs having the top ten SMARTscores™ were selected and tested in 
a functional assay to determine silencing efficiency. To accomplish this, each of the 
ten duplexes were synthesized using 2 5 -0-ACE chemistry and transfected at lOOnM 
20 concentrations into cells. Twenty- four hours later assays were performed on cell 
extracts to assess the degree of target silencing. Controls used in these experiments 
included mock transfected cells, and cells that were transfected with a non-specific 
siRNA duplex. ^ , ^ , 

25 The results of these experiments are presented below (and in Figure 14) and 

show that all ten of the selected siRNA induce 80% or better silencing of the Bcl2 
message at lOOnM concentrations. These data verify that the algorithm successfully 
identified functional Bcl2 siRNA and provide a set of functional agents that can be 
used in experimental and therapeutic environments. 
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siRNA 1 
siRNA 2 
siRNA 3 
siRNA 4 
siRNA 5 
siRNA 6 
siRNA 7 
siRNA 8 
siRNA 9 
siRNA 10 



GGGAGAUAGUGAUGAAGUA 

GAAGUACAUCCAUUAUAAG 

GUACGACAACCGGGAGAUA 

AGAUAGUGAUGAAGUACAU 

UGAAGACUCUGCUCAGUUU 

GCAUGCGGCCUCUGUUUGA 

UGCGGCCUCUGUUUGAUUU 

GAGAUAGUGAUGAAGUACA 

GGAGAUAGUGAUGAAGUAC 

GAAGACUCUGCUCAGUUUG 



SEQ.ID 
SEQ. ID 
SEQ.ID 
SEQ.ID 
SEQ.ID 
SEQ.ID 
SEQ.ID 
SEQ.ID 
SEQ.ID 
SEQ.ID 



NO. 301 
NO. 302 
NO. 303 
NO. 304 
NO. 305 
NO. 306 
NO. 307 
NO. 308 
NO. 309 
NO. 310 



10 



Bcl2 siRNA: Sense Strand, 5'->3' 



15 



Example VI. Sequences Selected by the Algorithm 

Sequences of the siRNAs selected using Formulas (Algorithms) VIII and EX 
with their corresponding ranking, which have been evaluated for the silencing activity 
in vivo in the present study (Formula VIII and DC, respectively). 



TABLE V 



Gene 


Accession 






Formula 


Formula 


Name 


Number 


SEQ. ID NO. 


FTllSeqTence 


vin 


IX 


CLTC 


NM_004859 


SEQ. ID NO. 0301 


GAAAGAATCTGTAGAGAAA 


76 


94.2 


CLTC 


NM_004859 


SEQ. ID NO. 0302 


GCAATGAGCTGTTTGAAGA 


65 


39.9 


CLTC 


NMJ)04859 


SEQ. ID NO. 0303 


TGACAAAGGTGGATAAATT 


57 


38.2 


CLTC 


NM_004859 


SEQ. ID NO. 0304 


GGAAATGGATCTCTTTGAA 


54 


49.4 


CLTA 


NM_001833 


SEQ. LD NO. 0305 


GGAAAGTAATGGTCCAACA 


22 


55.5 


CLTA 


NMJXH833 


SEQ. ID NO. 0306 


AGACAGTTATGCAGCTATT 


4 


22.9 


CLTA 


NM_001833 


SEQ. ID NO. 0307 


CC AATTCTCGG AAG C AAGA 


1 


17 


CLTA 


NM_001833 


SEQ. ID NO. 0308 


GAAAGTAATGGTCCAACAG 


-1 


-13 


CLTB 


NM_001834 


SEQ. ID NO. 0309 


GCGCCAGAGTGAACAAGTA 


17 


57.5 


CLTB 


NM_001834 


SEQ. ID NO. 0310 


GAAGGTGGCCCAGCTATGT 


15 


-8.6 


CLTB 


NM_001834 


SEQ. ID NO. 0311 


GGAACCAGCGCCAGAGTGA 


13 


40.5 


CLTB 


NM_001834 


SEQ. ID NO. 0312 


GAGCGAGATTGCAGGCATA 


20 


61.7 


CALM 


U45976 


SEQ. ID NO. 0313 


GTTAGTATCTGATGACTTG 


36 


-34.6 


CALM 


U45976 


SEQ. ID NO. 0314 


GAAATGGAACCACTAAGAA 


33 


46.1 


CALM 


U45976 


SEQ. ID NO. 0315 


GGAAATGGAACCACTAAGA 


30 


6L2 


CALM 


U45976 


SEQ. ID NO. 0316 


CAACTACACTTTCCAATGC 


28 


6.8 


EPS 15 


NMJXH9S1 


SEQ. ID NO. 0317 


CCACCAAGAT1TCATGATA 


48 


25.2 



WO 2004/045543 PCT/US2003/036787 

71 



EPS 15 


NM_001981 


SEQ. ED NO. 0318 


GATCGGAACTCCAACAAGA 


43 


49.3 


EPS 15 


NM_001981 


SEQ. ED NO. 0319 


AAACGGAGCTACAGATTAT 


39 


11.5 


T~>T"\ O 1 /" 

EPS 15 


NM_001981 


SEQ. ID NO. 0320 


CCACACAGCATTCTTGTAA 


33 


-23.6 


EPS15R 


NM_021235 


SEQ. ED NO. 0321 


GAAGTTACCTTGAGCAATC 


48 


33 


EPS15R 


NM_021235 


SEQ. ID NO. 0322 


GGACTTGGCCGATCCAGAA 


27 


33 


EPS15R 


NM_021235 


SEQ. ID NO. 0323 


GCACTTGGATCGAGATGAG 


20 


1.3 


EPS15R 


NM_021235 


SEQ. ED NO. 0324 


CAAAGACCAATTCGCGTTA 


17 


27.7 


DNM2 


NM_004945 


SEQ. ID NO. 0325 


CCGAATCAATCGCATCTTC 


6 


-29.6 


DNM2 


NM_004945 


SEQ. ID NO. 0326 


GACATGATCCTGCAGTTCA 


5 


-14 


DNM2 


NM_004945 


SEQ. ID NO. 0327 


GAGCGAATCGTCACCACTT 


5 


24 


DNM2 


NM_004945 


SEQ. ID NO. 0328 


CCTCCGAGCTGGCGTCTAC 


-4 


-63.6 


ARF6 


AF93885 


SEQ. ED NO. 0329 


TCACATGGTTAACCTCTAA 


27 


-21.1 


ARF6 


AF93885 


SEQ. ED NO. 0330 


GATGAGGGACGCCATAATC 


7 


-38.4 


ARF6 


AF93885 


SEQ.. IP NO. 0331 


CCTCTAACTACAAATCTTA 


4 


16.9 


ARP6 


AF93885 


SEQ. ID NO. 0332 


GGAAGGTGCTATCCAAAAT 


4 


11.5 


RAB5A 


BC001267 


SEQ. ID NO. 0333 


GCAAGCAAGTCCTAACATT 


40 


25.1 


RAB5A 


BC001267 


SEQ. ID NO. 0334 


GGAAGAGGAGTAGACCTTA 


17 


50.1 


RAB5A 


BC001267 


SEQ. ED NO. 0335 


AGGAATCAGTGTTGTAGTA 


16 


11.5 


RAB5A 


BC001267 


SEQ. ID NO. 0336 


GAAGAGGAGTAGACCTTAC 


12 


7 


RAB5B 


NMJ)02868 


SEQ. ID NO. 0337 


GAAAGTCAAGCCTGGTATT 


14 


18.1 


RAB5B . 


NM_002868 


SEQ. ID NO. 0338 


AAAGTCAAGCCTGGTATTA 


6 


-17.8 


RAB5B 


NM_002868 


SEQ. ID NO. 0339 


GCTATGAACGTGAATGATC 


3 


,21.1 


RAB5B 


NM_002868 


SEQ. ID NO. 0340 


CAAGCCTGGTATTACGTTT 


-7 


-37.5' 


RAB5C 


AF141304 


SEQ. ED NO. 0341 


G G AAC AAG ATCTGTC AATT 


38 


51.9* 


RAB5C 


AF141304 


SEQ. ID NO. 0342 


GCAATGAACGTGAACGAAA 


29 


43.7 


RAB5C 


AF141304 


SEQ. ID NO. 0343 


CAATGAACGTGAACGAAAT 


18 


43.3 


RAB5C 


AF141304 


SEQ. ED NO. 0344 


GGACAGGAGCGGTATCACA 


6 


18.2 


EEA1 


XMJH8197 


SEQ. ID NO. 0345 


" "aCjACAGAGCTTGAGAATAA 


67 


64.1 


EEA1 


XM_018197 


SEQ. ID NO. 0346 


GAGAAGATCTTTATGCAAA 


60 


48.7 


EEA1 


XM_018197 


SEQ. ID NO, 0347 


GAAGAGAAATCAGCAGATA 


58 


45.7 


EEA1 


XM_018197 


SEQ. ID NO. 0348 


G C AAGTAACTC AACTAAC A 


56 


72.3 


AP2B1 


NM_001282 


SEQ. ID NO. 0349 


GAGCTAATCTGCCACATTG 


49 


-12.4 


AP2B1 


NM_001282 


SEQ. ID NO. 0350 


GCAGATGAGTTACTAGAAA 


44 


48.9 


AP2B1 


NM_001282 


SEQ. ID NO. 0351 


CAACTTAATTGTCCAGAAA 


41 


28.2 


AP2B1 


NM_001282 


SEQ. ID NO. 0352 


CAACACAGGATTCTGATAA 


33 


-5.8 


PLK 


NM_005030 


SEQ. ED NO. 0353 


AGATTGTGCCTAAGTCTCT 


-35 


-3.4 


PLK 


NM_005030 


SEQ. ID NO. 0354 


ATGAAGATCTGGAGGTGAA 


0 


-4.3 


PLK 


NM_005030 


SEQ. ID NO. 0355 


TTTGAGACTTCTTGCCTAA 


-5 


-27.7 


PLK 


NM_005030 


SEQ. ED NO. 0356 


AGATCACCCTCCTTAAATA 


15 


72.3 


GAPDH 


NM_002046 


SEQ. ID NO. 0357 


CAACGGATTTGGTCGTATT 


27 


-2.8 
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OAPDH 


NM_002046 


SEQ. ID NO. 0358 


/** A T1TMT 

GAPDH 


NM_002046 


SEQ. ID NO. 0359 


GAPDH 


NM_002046 


SEQ. ID NO. 0360 


c-Myc 




SEQ. ID NO. 0361 


c-Myc 




SEQ. ID NO. 0362 


c-Myc 




SEQ. ID NO. 0363 


c-Myc 




SEQ. ID NO. 0364 


MAP2K1 


NM_002755 


SEQ. ID NO. 0365 


MAP2K1 


NM_002755 


SEQ. ID NO. 0366 


MAP2K1 


NM_002755 


SEQ. ED NO. 0367 


MAP2K1 


NMJ)02755 


SEQ. ID NO. 0368 


MAP2K2 


NMJ)30662 


SEQ. ID NO. 0369 


MAP2K2 


NM_030662 


SEQ. ID NO. 0370 


MAP2K2 


NM_030662 


SEQ. ID NO. 0371 


MAP2K2 


NM_030662 


SEQ. ID NO. 0372 


KNSL1(EG5) 


NM_004523 


SEQ. ID NO. 0373 


KNSLI(EG5) 


NM_004523 


SEQ. ID NO. 0374 


KNSL1(EG5) 


NM_004523 


SEQ. ID NO. 0375 


KNSL1(EG5) 


NM_004523 


SEQ. ED NO. 0376 


CyclophilinA_ 


NM_021130 


SEQ. ID NO. 0377 


CyclophilinA_ 


NMJ)21130 


SEQ. ID NO. 0378 


CycIophilinA_ 


NM_021130 


SEQ. ID NO. 0379 


CyclophilinA_ 


NMJ)21130 


SEQ. ID NO. 0380 


LAMIN A/C 


NM_1 70707 


SEQ. ID NO. 0381 


LAMINA/C 


NM_1 70707 


SEQ. ID NO. 0382 


LAMIN A/C 


NM_1 70707 


SEQ. ED NO. 0383 


LAMIN A/C 


NM_1 70707 


SEQ. ED NO. 0384 


CyclophilinB 


M60857 


SEQ. ID NO. 0385 


CyclophilinB 


M60857 


SEQ. ID NO. 0386 


CyclophilinB 


M60857 


SEQ. ID NO. 0387 


CyclophilinB 


M60857 


SEQ. ID NO. 0388 


DBIl 


NMJ)20548 


SEQ. ED NO. 0389 


DBI2 


NM_020548 


SEQ. ID NO. 0390 


DBI3 


NM_020548 


SEQ. ED NO. 0391 


DBI4 


NM_020548 


SEQ. ED NO. 0392 


rLUCl 




SEQ. ED NO. 0393 


rLUC2 




SEQ. ID NO. 0394 


rLUC3 




SEQ. ED NO. 0395 


rLUC4 




SEQ. ID NO. 0396 


SeAPl 


NM_031313 


SEQ. ID NO. 0397 
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GAAATCCCATCACCATCTT 


24 


• 3.9 


GACCTCAACTACATGGTTT 


22 


-22.9 


TGGTTTACATGTTCCAATA 


9 


9.8 


GAAGAAATCGATGTTGTTT 


31 


-11.7 


ACACAAACTTGAACAGCTA 


22 


51.3 


GGAAGAAATCGATGTTGTT 


18 


26 


GAAACGACGAGAACAGTTG 


18 


-8.9 


GCACATGGATGGAGGTTCT 


26 


16 


GCAGAGAGAGCAGATTTGA 


16 


0.4 


GAGGTTCTCTGGATCAAGT 


14 


15.5 


GAGCAGATTTGAAGCAACT 


14 


18.5 


CAAAGACGATGACTTCGAA 


37 


26.4 


G ATC AGC ATTTGC ATG GAA 


24 


-0.7 


TCCAGGAGTTTGTCAATAA 


17 


-4.5 


GGAAGCTGATCCACCTTGA 


16 


59.2 


GCAGAAATCTAAGGATATA 


53 


35.8 


CAACAAGGATGAAGTCTAT 


50 


18.3 


CAGCAGAAATCTAAGGATA 


41 


32.7 


CTAGATGGCTTTCTCAGTA 


39 


3.9 


AGACAAGGTCCCAAAGACA 


-16 


58.1 


GGAATGGCAAGACCAGCAA 


-6 


36 


AGAATTATTCCAGGGTTTA 


-3 


16.1 


GCAGACAAGGTCCCAAAGA 


8 


8.9 


AGAAGCAGCTTCAGGATGA 


31 


38.8 


GAGCTTGACTTCCAGAAGA 


33 


22.4 


CCACCGAAGTTCACCCTAA 


21 


27.5 


GAGAAGAGCTCCTCCATCA 


55 


30.1 


gaAagagcatctacggtga 


41 


83.9 


GAAAGGAITI'GGCTACAAA 


53 


59.1 


ACAGCAAATTCCATCGTGT 


-20 


28.8 


GG AAAGACTGTTCCAAAAA 


2 


27 


CAACACGCCTCATCCTCTA 


27 


-7.6 


CATGAAAGCTTACATCAAC 


25 


-30.8 


AAGATGCCATGAAAGCTTA 


17 


22 


GCACATACCGCCTGAGTCT 


15 


3.9 


GATCAAATCTGAAGAAGGA 


57 


49.2 


GCCAAGAAGTTTCCTAATA 


50 


13.7 


CAGCATATCTTGAACCATT 


41 


-2.2 


GAACAAAGGAAACGGATGA 


39 


29.2 


CGGAAACGGTCCAGGCTAT 


6 


26.9 
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NM 031313 


SEO ID NO. 0398 


G CTTCG AGC AG AC ATG AT A 


4 


-11.2 




\ru 031313 


SEO ID NO 0399 


CCTACACGGTCCTCCTATA 


4 


4.9 




MM 031313 


SEO ID NO. 0400 


GCCAAGAACCTCATCATCT 


1 


-9.9 


fT TTP1 




SEQ. ID NO. 0401 


GATATGGGCTGAATACAAA 


54 


40.4 






SEO ID NO. 0402 


GCACTCTGATTGACAAATA 


47 


54.7 






SEO ID NO. 0403 


TGAAGTCTCTGATTAAGTA 


46 


34.5 


fT TTP/l 




SEO ID NO. 0404 


TCAGAGAGATCCTCATAAA 


40 


11.4 


LDV^yCIO 1 




SEO ID NO. 0405 


GCAAGAAGATCACCATrrC 


52 


46.4 


mwycio__z 


"MM 00RQ07 

rNlvl UU07U / 


SEO ED NO 0406 


GAGAGAAATTTGAGGATGA 


36 


70.7 


mi^ycio_j 


MM 00RQ07 


SEO ID NO 0407 


GAAAGGATTTGGCTATAAG 


35 


-1.5 


niLyycio_j+ 


vnvyf 008Q07 


SEO ID NO 0408 


GAAAGAAGGCATGAACATT 


27 


10.3 


Q /~>T 0 1 




SEO ID NO 0409 


GGGAGATAGTGATGAAGTA 


21 


72 




J.N1V1 V/UV/UJj 


SFO ID NO 0410 


GAAGTACATCCATTATAAG 


1 


3.3 




MN/f 000633 


SEO ID NO 0411 


GTACGACAACCGGGAGATA 


1 


35.9 


r>LJL,Z_4 


MTuf 000633 


SFO ID NO 0412 


AGATAGTGATGAAGTACAT 


-12 


22.1 




XTM* 0006^3 
iN 1V1_\J UUOjj 


WO TDNO 0413 


TGAAGACTCTGCTCAGTTT 


36 


19.1 


BCL2_o 


lNiVl UUUOJ J 


^FO TDNO 0414 


GCATGCGGCCTCTGTTTGA 


5 


-9.7 


QB1 


TviK/r nnii/^ i 
JNM_UU JOOD. 1 




GCACACAGCUUACUACAUC 


52 


-4.8 


QB2 


iN M_W j j 0 J . 1 




GAAAUGCCCUGGUAUCUCA 


49 


22.1 


Q133 


-\TKyf 1 
JNJVL_UUj 30 D, 1 


qpf) TDNO 0417 


GAAGGAACGUGAUGUGAUC 


34 


22.9 


QB4 


1N1VI UUjjOj.1 


^IFO TT> NO 04 1 8 


GCACUACUCCUGUGUGUGA 


28 


20.4 


a nrc i 1 
Albl-1 


"Miv/f nrnfiA 1 

JN1YI Uu/Utl 


SFO ID NO 0419 


GAACCCAGCUGGAGAACUU 


45 


15.5 


AlJal-z 




SFO TDNO 0420 


GAUAUACAGUGUGAUCUUA 


40 


12.2 


A TC 1 1 




SFO TDNO 0421 


GUACUACGAUCCUGAUUAU 


37 


32.9 


ATE1-4 


"XTK 4 f\f\Hf\A "1 

NM_007041 




Gl JGrCGACCUUUACAAUUU 


35 


18.2 


EGFR-1 


NM_005228 


SEQ. ID NO. 0423 


GAAGGAAACTGAATTCAAA 


68 


79.4 


EGFR-1 


NM_005228 


SEQ. ID NO. 0424 


GGAAATATGTACTACGAAA 


49 


49.5 


EGFR-1 


NM_005228 


SEQ. ID NO. 0425 


CCACAAAGCAGTGAATri'A 


41 


7.6 


EGFR-1 


NNT 005228 


SEQ. ID NO. 0426 


GTAACAAGCTCACGCAGTT 


40 


25.9 



Example VII. Genome-Wide Application of the Algorithm 

The examples described above demonstrate that the algorithm(s) can 
5 successfully identify functional silUNA and that these duplexes can be used to induce 
the desirable phenotype of transcriptional knockdown or knockout. Each gene or 
family of genes in each organism plays an important role in maintaining physiological 
homeostasis and the algorithm can be used to develop functional, highly functional, or 
hyperfunctional siRNA to each gene. To accomplish this for the human genome, the 
1 0 entire online ncbi refseq database was accessed through Entrez (efetch). The database 
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was processed through Formula VIII. For each gene the top 80 -1 00 scores for 
siRNAs were obtained and BLAST 5 ed to insure that the selected sequences are 
specific in targeting the gene of choice. These sequences are provided on the 
enclosed CDs in electronic form. Accordingly, Applicants hereby incorporate by 
5 reference the material submitted herewith, in duplicate on the compact disks labeled 
COPY 1 - TABLES PART, DISK 1/1, TABLES 12-15, Filed with RO/US under 
PCT AI sec. 801(a), Operating System: MS-Windows, COPY 2 - TABLES PART, 
DISK 1/1, TABLES 12 -15, Filed with RO/US under PCT AI sec. 801(a), Operating 
System: MS-Windows, COPY 3 - TABLES PART, DISK 1/1, TABLES 12 -15, 

1 0 Filed with RO/US under PCT AI sec. 801 (a), Operating System: MS-Windows,; 
which copies are identical, in files entitled Table_12.txt, date of creation June 26, 
2003, with a size of 31,045 kb; Table_13.txt, date of creation November 13, 2003, 
with a size of 78,451 kb; TableJ4.txt, date of creation November 13, 2003, with a 
size of 454 kb; and Table_15.txt date of creation November 13, 2003, with a size of 

15 1,690 kb. 

With respect to the disks, there are four tables on each disk copy in text 
format: Tables XII -XV. Table XII, which is located in a file entitled Table_12.txt, 
provides a list of the 80-100 sequences for each target, identified by Formula VIII as 

20 having the highest relative SMARTscores™ for the target analyzed. Table XIII, 

which is located in a file entitled Table_13.txt, provides the SMARTscores™, and for 
each gene, a pool pick of up to four sequences is denoted. (The denotation of "1" in 
Table XIII means that it is a pool pick.) These pool pick sequences represent the most 
functional siRNAs for the corresponding target. Any 1, 2, 3, or 4 of the pool pick 

25 sequences could be used for gene silencing. Further, sequences that are not denoted 
as pool pick sequences, but that are included on the compact disks may also be used 
for gene silencing either alone or in combination with other sequences. However, 
their individual relative functionality would be less than that of a pool pick sequence. 
Table XIV, which is located in a file entitled Table_14.txt, provides an identification 

30 of genes by accession number, and Table XV, which is located in a file entitled 
Table__15.txt, provides a short name for the genes identified on the disk. The 
information contained on the disks is part of this patent application and are 
incorporated into the specification by reference. One may use these tables in order to 
identify functional siRNAs for the gene provided therein, by simply looking for the 
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gene of interest and an siRNA that is listed as functional. Preferably, one would 
select one or more of the siRNA that most optimized for the target of interest and is 
denoted as a pool pick. 



5 Table XII: siRNA selected by Formula VIII 

See data submitted herewith on a CD-ROM in accordance with PCT 
Administrative Instructions Section 801(a) 

Table XHI: SMARTscores™ 
10 See data submitted herewith on a CD-ROM in accordance with PCT 

Administrative Instructions Section 801(a) 



Table XTV: Identification of Targets 

See data submitted herewith on a CD-ROM in accordance with PCT 
15 Administrative Instructions Section 801(a) 



20 



Table XV: Description of Targts 

See data submitted herewith on a CD-ROM in accordance with PCT 
Administrative Instructions Section 801(a) 



Many of the genes to which the described siRNA are directed play critical 
roles in disease etiology. For this reason, the siRNA listed in the accompanying 
compact disk may potentially act as therapeutic agents.. A number of prophetic 
examples follow and should be understood in view of the siRNA that are identified on 
25 the accompanying CD. To isolate these siRNA, the appropriate message sequence for 
each gene is analyzed using one of the before mentioned formulas (preferably formula 
VIII) to identify potential siRNA targets. Subsequently these targets are BLAST' ed to 
eliminate homology with potentially off-targets. 

30 The list of potential disease targets is extensive. For instance, over-expression 

of Bel 10 has been implicated in the development of MALT lymphoma (mucosa 
associated lymphoid tissue lymphoma) and thus, functional, highly functional, or 
hyperfunctional siRNA directed against that gene (e.g. SEQ. ID NO. 0427: 
GGAAACCUCUCAUUGCUAA; SEQ. ID NO. 0428: 
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GAAAGAACCUUGCCGAUCA; SEQ. ID NO. 0429: 
GGAAAUACAUCAGAGCUUA, or SEQ. ID NO. 0430: 
GAAAGUAUGUGUCUUAAGU) may contribute to treatment of this disorder. 

5 In another example, studies have shown that molecules that inhibit 

glutamine:fructose-6-phosphate aminotransferase (GFA) may act to limit the 
symptoms suffered by Type II diabetics. Thus, functional, highly functional, or 
hyperfunctional siRNA directed against GFA (also known as GFPT1: siRNA = SEQ. 
ID NO. 0433 UGAAACGGCUGCCUGAUUU; SEQ. ID NO. 0434 
1 0 GAAGUU ACCUCUUAC AUUU; SEQ. ID NO. 043 5 
GUACGAAACUGUAUGAUUA; SEQ. ID NO. 0436 

GGACGAGGCUAUCAUUAUG) may contribute to treatment of this disorder. 

In another example, the von Hippel-Lindau (VHL) tumor suppressor has been 

1 5 observed to be inactivated at a high frequency in sporadic clear cell renal cell 
carcinoma (RCC) and RCCs associated with VHL disease. The VHL tumor 
suppressor targets hypoxia-inducible factor- 1 alpha (HIF-1 alpha), a transcription 
factor that can induce vascular endothelial growth factor (VEGF) expression, for 
ubiquitination and degradation. Inactivation of VHL can lead to increased levels of 

20 HIF-1 alpha, and subsequent VEGF over expression. Such over expression of VEGF 
has been used to explain the increiased (and possibly necessary) vascularity observed 
in RCC. Thus, functional, highly functional, or hyperfunctional siRNAs directed 
against either HIF-1 alpha (SEQ. ID NO. 0437 GAAGGAACCpSAUGCUUUA; 
SEQ. ID NO. 0438 GCAUAUAUCUAGAAGGUAU; SEQ. ID NO. 0439 

25 GAACAAAUACAUGGGAUUA; SEQ. ID NO. 0440 

GGACACAGAUUUAGACUUG) or VEGF (SEQ. ID NO. 0441 
GAACGUACUUGCAGAUGUG; SEQ. ID NO. 0442 
GAGAAAGCAUUUGUUUGUA; SEQ. ID NO. 0443 
GGAGAAAGCAUUUGUUUGU; SEQ. ID NO. 0444 

30 CGAGGCAGCUUGAGUUAAA) may be useful in the treatment of renal cell 
carcinoma. 

In another example, gene expression of platelet derived growth factor A and B 
(PDGF-A and PDGF-B) has been observed to be increased 22- and 6-fold, 
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respectively, in renal tissues taken from patients with diabetic nephropathy as 

compared with controls. These findings suggest that over expression ofPDGF A and 

B may play a role in the development of the progressive fibrosis that characterizes 

human diabetic kidney disease. Thus, functional, highly functional, or hyperfunctional 

siRNAs directed against either PDGF A 

(SEQ. ID NO. 0445: GGUAAGAUAUUGUGCUUUA; 

SEQ. ID NO. 0446: CCGCAAAUAUGCAGAAUUA; 

SEQ. ID NO. 0447: GGAUGUACAUGGCGUGUUA; 

SEQ. ID NO. 0448: GGUGAAGUUUGUAUGUUUA) or 



10 



PDGFB 

(SEQ. ID NO. 0449: CCGAGGAGCUUUAUGAGAU; 
SEQ. ID NO. 0450: GCUCCGCGCUUUCCGAUUU; 
SEQ. ID NO. 0451 GAGCAGGAAUGGUGAGAUG; 
15 SEQ. ID NO. 0452: GAACUUGGGAUAAGAGUGU; 
SEQ. ID NO. 0453 CCGAGGAGCUUUAUGAGAU; 

SEQ. ID NO. 0454 UUUAUGAGAUGCUGAGUGA) may be useful in the treatment 
of this form of kidney disorder. 

20 In another example, a strong correlation exists between the over-expression of 

glucose transporters (e.g. GLUT12) and cancer cells. It is predicted that cells 
undergoing uncontrolled cell growth up-regulate GLUT molecules so that they can 
cope with the heightened energy needs associated with increasedrate&of proliferation 
and metastasis. Thus, siRNA-based therapies that target the molecules such as 

25 GLUT1 (also known as SLC2A1 : siRNA= 

SEQ. ID NO.: 0455 GCAAUGAUGUCCAGAAGAA; 
SEQ. ID NO.: 0456 GAAGAAUAUUCAGGACUUA: 
SEQ. ID NO.: 0457 GAAGAGAGUCGGCAGAUGA; 
SEQ. ID NO.: 0458 CCAAGAGUGUGCUAAAGAA) 



30 



GLUT12 (also known as SLCA12: siRNA = 
SEQ. ID NO. 0459: GAGACACUCUGAAAUGAUA; 
SEQ. ID NO. 0460: GAAAUGAUGUGGAUAAGAG; 
SEQ. ID NO. 0461 : GAUCAAAUCCUCCCUGAAA; 
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SEQ. ID NO. 0462: UGAAUGAGCUGAUGAUUGU) and other related transporters, 
may be of value in treating a multitude of malignancies. 

The siRNA sequences listed above are presented in a 5'-> 3' sense strand 
direction. In addition, siRNA directed against the targets listed above as well as those 
directed against other targets and listed in the accompanying compact disk may be 
useful as therapeutic agents. 

Example Vm. Evidence for the Benefits of Pooling 

Evidence for the benefits of pooling have been demonstrated using the 
reporter gene, luciferase. Ninety siRNA duplexes were synthesized using Dharmacon 
proprietary ACE® chemistry against one of the standard reporter genes: firefly 
luciferase. The duplexes were designed to start two base pairs apart and to cover 
approximately 180 base pairs of the luciferase gene (see sequences in Table III). 
Subsequently, the siRNA duplexes were co-transfected with a luciferase expression 
reporter plasmid into HEK293 cells using standard transfection protocols and 
luciferase activity was assayed at 24 and 48 hours. 

Transfection of individual siRNAs showed standard distribution of inhibitory 
effect. Some duplexes wereractive, while others were not. Figure 15 represents a 
typical screen of ninety siRNA duplexes (SEQ. ID NO. 0032- 0120) positioned two 
base pairs apart. As the figure suggests, the functionality of the siRNA duplex is 
determined more by a particular sequence of the oligonucleotide than b.yJhe.relative 
oligonucleotide position within a gene or excessively sensitive part of the mRNA, 
which is important for traditional anti-sense technology. 

When two continuous oligonucleotides were pooled together, a significant 
increase in gene silencing activity was observed. (See Figure 16) A gradual increase 
in efficacy and the frequency of pools functionality was observed when the number of 
siRNAs increased to 3 and 4. (Figures 16, 17). Further, the relative positioning of 
the oligonucleotides within a pool did not determine whether a particular pool was 
functional (see Figure 18, in which 100% of pools of oligonucleotides distanced by 2, 
10 and 20 base pairs were functional). 
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However, relative positioning may nonetheless have an impact. An increased 
functionality may exist when the siRNA are positioned continuously head to toe (5' 
end of one directly adjacent to the 3 5 end of the others). 



5 Additionally, siRNA pools that were tested performed at least as well as the 

best oligonucleotide in the pool, under the experimental conditions whose results are 
depicted in Figure 19. Moreover, when previously identified non-functional and 
marginally (semi) functional siRNA duplexes were pooled together in groups of five 
at a time, a significant functional cooperative action was observed. (See Figure 20) 
10 In fact, pools of semi-active oligonucleotides were 5 to 25 times more functional than 
the most potent oligonucleotide in the pool. Therefore, pooling several siRNA 
duplexes together does not interfere with the functionality of the most potent siRNAs 
within a pool, and pooling provides an unexpected significant increase in overall 
functionality 

15 

Example IX. Pooling Across Species 

Experiments were performed on the following genes: P-galactosidase, Renilla 
luciferase, and Secreted alkaline phosphatase, which demonstrates the benefits of 
pooling, (see Figure 21) Approximately 50% of individual siRNAs designed to 
20 silence the above-specified geneSF were functional, while 1 00% of the pools that 
contain the same siRNA duplexes were functional. 

Example X. Highly Functional siRNA 

Pools of five siRNAs in which each two siRNAs overlap to 10-90% resulted 
25 in 98% functional entities (>80% silencing). Pools of siRNAs distributed throughout 
the mRNA that were evenly spaced, covering an approximate 20 - 2000 base pair 
range, were also functional. When the pools of siRNA were positioned continuously 
head to tail relative to mRNA sequences and mimicked the natural products of Dicer 
cleaved long double stranded RNA, 98% of the pools evidenced highly functional 
30 activity (>95% silencing). 



Example XI. Human cyclophyline 



WO 2004/045543 PCI7US2003/036787 

80 

Table III above lists the siRNA sequences for the human cyclophyline 
protein. A particularly functional siRNA may be selected by applying these 
sequences to any of Formula I to VII above. 



5 Alternatively, one could pool 2, 3, 4, 5 or more of these sequences to create a 

kit for silencing a gene. Preferably, within the kit there would be at least one 
sequence that has a relatively high predicted functionality when any of Formulas I - 
VII is applied. 

10 Example XII. Sample Pools of siRNAs and Their Application to Human Disease 

The genetic basis behind human disease is well documented and siRNA may 
be used as both research or diagnostic tools and therapeutic agentsr either individually 
or in pools. Genes involved in signal transduction, the immune response, apoptosis, 
DNA repair, cell cycle control, and a variety of other physiological functions have 
15 clinical relevance and therapeutic agents that can modulate expression of these genes 
may alleviate some or all of the associated symptoms. In some instances, these genes 
can be described as a member of a family or class of genes and siRNA (randomly, 
conventionally, or rationally designed) can be directed against one or multiple 
members of the family to induce a desired result. 

20 

To identify rationally designed siRNA to each gene, the sequence was 
analyzed using Formula VIII to identify a SMARTpool containing the functional 
sequences. To confirm the activity of these sequences, the siRNA are introduced into 
a cell type of choice (e.g. HeLa cells, HEK293 cells) and the levels of the appropriate 

25 message are analyzed using one of several art proven techniques. SiRNA having 

heightened levels of potency can be identified by testing each of the before mentioned 
duplexes at increasingly limiting concentrations. Similarly, siRNA having increased 
levels of longevity can be identified by introducing each duplex into cells and testing 
functionality at 24, 48, 72, 96, 120, 144, 168, and 192 hours after transfection. Agents 

30 that induce >95% silencing at sub-nanomolar concentrations and/or induce functional 
levels of silencing for >96 hours are considered hyperfunctional. 

The following are non-limiting examples of families of proteins to which 
siRNA described in this document are targeted against: 
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Transporters, Pumps, and Channels 

5 Transporters, pumps, and channels represent one class of genes that are 

attractive targets for siRNAs. One major class of transporter molecules are the ATP- 
binding cassette (ABC) transporters. To date, nearly 50 human ABC-transporter 
genes have been characterized and have been shown to be involved in a variety of 
physiological functions including transport of bile salts, nucleosides, chloride ions, 

10 cholesterol, toxins, and more. Predominant among this group are MDR1 (which 
encodes the P-glycoprotein, NP_000918), the MDR-related proteins (MRP 1-7), and 
the breast cancer resistance protein (BCRP). In general, these transporters share a 
common structure, with each protein containing a pair of ATP-binding domains (also 
known as nucleotide binding folds, NBF) and two sets of transmembrane (TM) 

1 5 domains, each of which typically contains six membrane-spanning a-helices. The 
genes encoding this class of transporter are organized as either full transporters (i.e. 
containing two TM and two NBF domains) or as half transporters that assemble as 
either homodimers or heterodimers to create functional transporters. As a whole, 
members of the family are widely dispersed throughout the genome and show a high 

20 degree of amino acid sequence identify among eukaryotes. 

ABC-transporters have been implicated in several human diseases. For 
instance, molecular efflux pumps of this type play a major role in the development of 
drug resistance exhibited by a variety of cancers and pathogenic microorganisms. In 

25 the case of human cancers, increased expression of the MDR1 gene and related pumps 
have been observed to generate drug resistance to a broad collection of commonly 
used chemotherapeutics including doxorubicin, daunorubicin, vinblastine, vincristine, 
colchicines. In addition to the contribution these transporters make to the 
development of multi-drug resistance, there are currently 1 3 human genetic diseases 

30 associated with defects in 14 different transporters. The most common of these 

conditions include cystic fibrosis, Stargardt disease, age-related macular degeneration, 
adrenoleukodystrophy, Tangier disease, Dubin- Johnson syndrome and progressive 
familial intrahepatic cholestasis. For this reason, siRNAs directed against members of 
this, and related, families are potentially valuable research and therapeutic tools. 
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With respect to channels, analysis of Drosophila mutants has enabled the 
initial molecular isolation and characterization of several distinct channels including 
(but not limited to) potassium (K+) channels. This list includes shaker (Sh), which 
encodes a voltage activated K + channel, slowpoke (Slo), a Ca 2+ activated K + channel, 
5 and ether-a-go-go (Eag). The Eag family is further divided into three subfamilies: 
Eag, Elk (eag-like K channels), and Erg (Eag related genes). 

The Erg subfamily contains three separate family members (Ergl-3) that are 
distantly related to the sh family of voltage activated K + channels. Like sh, erg 

1 0 polypetides contain the classic six membrane spanning architecture of K + channels 
(S1-S6) but differ in that each includes a segment associated with the C-terminal 
cytoplasmic region that is homologous to cyclic nucleotide binding domains (cNBD). 
Like many isolated ion channel mutants, erg mutants are temperature-sensitive 
paralytics, a phenotype caused by spontaneous repetitive firing (hyperactivity) in 

1 5 neurons and enhanced transmitter release at the neuromuscular junction. 

Initial studies on the tissue distribution of all three members of the erg 
subfamily show two general patterns of expression. Ergl and erg3 are broadly 
expressed throughout the nervous system and are observed in the heart, the superior 

20 mesenteric ganglia, the celiac ganglia, the retina, and the brain. In contrast, erg2 
shows a much more restricted pattern of expression and is only observed in celiac 
ganglia and superior mesenteric ganglia. Similarly, the kinetic properties of the three 

. ; ■ erg potassium channels are not homogeneous. Ergl and erg2 channels are relatively 
slow activating delayed rectifiers whereas the erg3 current activates rapidly and then 

25 exhibits a predominantly transient component that decays to a sustained plateau. The 
current properties of all three channels are sensitive to methanesulfonanilides, 
suggesting a high degree of conservation in the pore structure of all three proteins. 

Recently, the erg family of K + channels has been implicated in human disease. 
30 Consistent with the observation that ergl is expressed in the heart, single strand 
conformation polymorphism and DNA sequence analyses have identified HERG 
(human ergl) mutations in six long-QT-syndrome (LQT) families, an inherited 
disorder that results in sudden death from a ventricular tachyarrythmia. Thus siRNA 
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directed against this group of molecules (e.g. KCNH1-8) will be of extreme 
therapeutic value. 



Another group of channels that are potential targets of siRNAs are 
5 the CLCA family that mediate a Ca 2+ -activated CP conductance in a variety of 
tissues. To date, two bovine (bCLCl; bCLCA2 (Lu-ECAM-1)) 5 three mouse 
(mCLCAl; mCLCA2; mCLCA3) and four human (hCLCAl; hCLCA2; hCLCA3; 
hCLCA4) CLCA family members have been isolated and patch-clamp studies with 
transfected human embryonic kidney (HEK-293) cells have shown that bCLCAl, 
10 mCLCAl, and hCLCAl mediate a Ca 2+ -activated CP conductance that can be 
inhibited by the anion channel blocker DIDS and the reducing agent dithiothreitol 
(DTT). 

The protein size, structure, and processing seem to be similar among different 
1 5 CLCA family members and has been studied in greatest detail for Lu-ECAM- 1 . The 
Lu-ECAM-1 open reading frame encodes a precursor glycoprotein of 130 kDa that is 
processed to a 90-kDa ammo-terminal cleavage product and a group of 30- to 40-kDa 
glycoproteins that are glycosylation variants of a single polypeptide derived from its 
carboxy terminus. Both subunits are associated with the outer cell surface, but only 
20 the 90-kDa subunit is thought to be anchored to.the cell membrane via four 
transmembrane domains. 

. Although the protein processing and function appear to be conserved among 
CLCA homology significant differences exist in their tissue expression patterns. For 

25 example, bovine Lu-ECAM-1 is expressed primarily in vascular endothelia, bCLCAl 
is exclusively detected in the trachea, and hCLCAl is selectively expressed in a 
subset of human intestinal epithelial cells. Thus the emerging picture is that of a 
multigene family with members that are highly tissue specific, similar to the C1C 
family of voltage-gated CP channels. The human channel, hCLCA2, is particular 

30 interesting from a medical and pharmacological standpoint. CLCA2 is expressed on 
the luminal surface of lung vascular endothelia and serves as an adhesion molecule 
for lung metastatic cancer cells, thus mediating vascular arrest and lung colonization. 
Expression of this molecule in normal mammary epithelium is consistently lost in 
human breast cancer and in nearly all tumorigenic breast cancer cell lines. Moreover, 
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re-expression of hCLCA2 in human breast cancer cells abrogates tumorigenicity in 
nude mice, implying that hCLCA2 acts as a tumour suppressor in breast cancer. For 
these reasons, siRNA directed against CLCA family members and related channels 
may prove to be valuable in research and therapeutic venues. ' 

5 

Transporters Involved in Synaptic Transmission. 

Synaptic transmission involves the release of a neurotransmitter into the 
synaptic cleft, interaction of that transmitter with a postsynaptic receptor, and 
subsequent removal of the transmitter from the cleft. Li most synapses the signal is 
1 0 terminated by a rapid reaccumulation of the neurotransmitter into presynaptic 

terrninals. This process is catalyzed by specific neurotransmitter transporters that are 
often energized by the electrochemical gradient of sodium across the plasma 
membrane of the presynaptic cells. 

1 5 Aminobutyric acid (GAB A) is the major inhibitory neurotransmitter in the 

central nervous system. The inhibitory action of GABA, mediated through GABA A / 
GABAb receptors, and is regulated by GABA transporters (GATs), integral 
membrane proteins located perisynaptically on neurons and glia. So far four different 
carriers (GAT1-GAT4) have been cloned and their cellular distribution has been 

20 partly worked out. Comparative sequence analysis has revealed that GABA 

transporters are related to several other proteins involved in neurotransmitter uptake 
including gamma-aminobutyric acid transporters, monoamine transporters, amino 
acid transporters, certain "orphan" transporters, and the recently discovered bacterial 
transporters. Each of these proteins has a similar 12 transmembrane helices topology 

25 and relies upon the Na+/Cl- gradient for transport function. Transport rates are 

dependent on substrate concentrations, with half-maximal effective concentrations for 
transport frequently occurring in the submicromolar to low micromolar range. In 
addition, transporter function is bidirectional, and non-vesicular efflux of transmitter 
may contribute to ambient extracellular transmitter levels. 



30 



Recent evidence suggests that GABA transporters, and neurotransmitter 
transporters in general, are not passive players in regulating neuronal signaling; rather, 
transporter function can be altered by a variety of initiating factors and signal 
transduction cascades. In general, this functional regulation occurs in two ways, 
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either by changing the rate of transmitter flux through the transporter or by changing 
the number of functional transporters on the plasma membrane. A recurring theme in 
transporter regulation is the rapid redistribution of the transporter protein between 
intracellular locations and the cell surface. In general, this functional modulation 
occurs in part through activation of second messengers such as kinases, phosphatases, 
arachidonic acid, and pH. However, the mechanisms underlying transporter 
phosphorylation and transporter redistribution have yet to be fully elucidated. 



GABA transporters play a pathophysiological role in a number of human 
1 0 diseases including temporal lobe epilepsy and are the targets of pharmacological 

interventions. Studies in seizure sensitive animals show some (but not all) of the GAT 
transporters have altered levels of expression at times prior to and post seizure, 
suggesting this class of transporter may affect epileptogenesis, and that alterations 
following seizure may be compensatory responses to modulate seizure activity. For 
1 5 these reasons, siRNAs directed against members of this family of genes (including but 
not limited to SLCG6A1-12) may prove to be valuable research and therapeutic tools. 

Organic Ion Transporters. 

The human body is continuously exposed to a great variety of xenobiotics, via 
20 food, drugs, occupation, and environment. Excretory organs such as kidney, liver, and 
intestine defend the body against the potentially harmful effects of these compounds 
by transforming them into less active metabolites that are subsequently secreted from 
the system. 

25 Carrier-mediated transport of xenobiotics and their metabolites exist for the 

active secretion of organic anions and cations. Both systems are characterized by a 
high clearance capacity and tremendous diversity of substances accepted, properties 
that result from the existance of multiple transporters with overlapping substrate 
specificities. The class of organic anion transporters plays a critical role in the 

50 elimination of a large number of drugs (e.g. , antibiotics, chemotherapeutics, diuretics, 
nonsteroidal anti -inflammatory drugs, radiocontrast agents, cytostatics); drug 
metabolites (especially conjugation products with glutathione, glucuronide, glycine, 
sulfate, acetate); and toxicants and their metabolites (e.g. , mycotoxins, herbicides, 



WO 2004/045543 PCT/US2003/036787 

86 

plasticizers, glutathione S-conjugates of polyhaloalkanes,polyhaloalkenes, 
hydroquinones, aminophenols), many of which are specifically harmful to the kidney. 

Over the past couple of years the number of identified anion transporting 
5 molecules has grown tremendously. Uptake of organic anions (OA") across the 
basolateral membrane is mediated by the classic sodium-dependent organic anion 
transport system, which includes a-ketoglutarate (a-KG 2 ")/OA" exchange via the 
organic anion transporter (OAT1) and sodium-ketoglutarate cotransport via the 
NaVdicarboxylate cotransporter (SDCT2). The organic anion transporting polypetide, 
10 Oatp 1 , and the kidney-specific OAT-K1 and OAT-K2 are seen as potential molecules 
that mediate facilitated OA - efflux but could also be involved in reabsorption via an 
exchange mechanism. Lastly the PEPT1 and PEPT2 mediate luminal uptake of 
peptide drugs, whereas CNT1 and CNT2 are involved in reabsorption of nucleosides 

1 5 The organic anion-transporting polypeptide 1 (Oatpl ) is a Na + - and ATP- 

independent transporter originally cloned from rat liver. The tissue distribution and 
transport properties of the Oatpl gene product are complex. Oatpl is localized to the 
basolateral membrane of hepatocytes, and is found on the apical membrane of S3 
proximal tubules. Studies with transiently transfected cells (e.g. HeLa cells) have 

20 indicated that Oatpl mediates transport of a variety of molecules including 

taurocholate, estrone-3 -sulfate, aldosterone, Cortisol, and others. The observed uptake 
of taurocholate by Oatpl expressed inX laevis oocytes is accompanied by efflux of 
GSH, suggesting that transport by this molecule may be glutathione dependent. 



25 



30 



Computer modeling suggests that members of the Oatp family are highly 
conserved, hydrophobic, and have 12 transmembrane domains. Decreases in 
expression of Oatp family members have been associated with cholestatic liver 
diseases and human hepatoblastomas, making this family of proteins of key interest to 
researchers and the medical community. For these reasons, siRNAs directed against 
OAT family members (including but riot limited to SLC21A2, 3, 6, 8, 9, 11, 12, 14, 
15, and related transporters) are potentially useful as research and therapeutic tools. 



Nucleoside transporters 
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Nucleoside transporters play key roles in physiology and pharmacology. 
Uptake of exogenous nucleosides is a critical first step of nucleotide synthesis in 
tissues such as bone marrow and intestinal epithelium and certain parasitic organisms 
that lack de novo pathways for purine biosynthesis. Nucleoside transporters also 
5 control the extracellular concentration of adenosine in the vicinity of its cell surface 
receptors and regulate processes such as neurotransmission and cardiovascular 
activity. Adenosine itself is used clinically to treat cardiac arrhythmias, and nucleoside 
transport inhibitors such as dipyridamole, dilazep, and draflazine function as coronary 
vasodilators. 

In mammals, plasma membrane transport of nucleosides is brought about by 
members of the concentrative, Na + -dependent (CNT) and equilibrative, Na + - 
independent (ENT) nucleoside transporter families. CNTs are expressed in a tissue- 
specific fashion; ENTs are present inmost, possibly all, cell types and are responsible 

1 5 for the movement of hydrophilic nucleosides and nucleoside analogs down their 
concentration gradients. In addition, structure/function studies of ENT family 
members have predicted these molecules to contain eleven transmembrane helical 
segments with an amino terminus that is intracellular and a carboxyl terminus that is 
extracellular. The proteins have a large glycosylated loop between TMs 1 and 2 and 

20 a large cytoplasmic loop between TMs 6 and 7. Recent investigations have implicated 
the TM 3-6 region as playing a central role in solute recognition. The medical 
importance of the ENT family of proteins is broad. In humans adenosine exerts a 
range ofxardioprotective effects and inhibitors of ENTs are seen as being valuable in 
alleviating a variety of cardio/cardiovascular ailments. In addition, responses to 

25 nucleoside analog drugs has been observed to vary considerably amongst e.g. cancer 
patients. While some forms of drug resistance have been shown to be tied to the up- 
regulation of ABC-transporters (e.g, MDR1), resistance may also be the result of 
reduced drug uptake (i.e. reduced ENT expression). Thus, a clearer understanding of 
ENT transporters may aid in optimizing drug treatments for patients suffering a wide 

30 range of malignancies. For these reasons, siRNAs directed against this class of 

molecules (including SLC28A1-3, SLC29A1-4, and related molecules) may be useful 
as therapeutic and research tools. 



Sulfate Transporters. 
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All cells require inorganic sulfate for normal function. Sulfate is the fourth 
most abundant anion in human plasma and is the major source of sulfur in many 
organisms. Sulfation of extracellular matrix proteins is critical for maintaining 
normal cartilage metabolism and sulfate is an important constituent of myelin 
5 membranes found in the brain 



Because sulfate is a hydrophilic anion that cannot passively cross the lipid 
bilayer of cell membranes, all cells require a mechanism for sulfate influx and efflux 
to ensure an optimal supply. To date, a variety of sulfate transporters have been 

10 identified in tissues from many origins. These include the renal sulfate transporters 
(NaSi-1 and Sat-1), the ubiquitously expressed diastrophic dysplasia sulfate 
transporter (DTDST), the intestinal sulfate transporter (DRA), and the erythrocyte 
anion exchanger (AE1). Most, if not all, of these molecules contain the classic 12 
transmembrane spanning domain architecture commonly found amongst members of 

15 the anion transporter superfamily. 

Recently three different sulfate transporters have been associated with specific 
human genetic diseases. Family members SLC26A2, SLC26A3, and SLC26A4 have 
been recognized as the disease genes mutated in diastrophic dysplasia, congenital 

20 chloride diarrhea (CLD), and Pendred syndrome (PDS), respectively? DTDST is a 

particularly complex disorder. The gene encoding this molecule maps to chromosome 
5q, and encodes two distinct transcripts due to alternative exon usage. In contrast to 
other sulfate .transppEters {e.g. Sat-1) anion movement by the DTDST protein is 
markedly inhibited by either extracellular chloride or bicarbonate. Impaired function 

25 of the DTDST gene product leads to undersulfation of proteoglycans and a complex 
family of recessively inherited osteochondrodysplasias (achondrogenesis type IB, 
atelosteogenesis type II, and diastrophic dysplasia) with clinical features including but 
not limited to, dwarfism, spinal deformation, and specific joint abnormalities. 
Interestingly, while epidemiological studies have shown that the disease occurs in 

30 most populations, it is particularly prevalent in Finland owing to an apparent founder 
effect. For these reasons, siRNAs directed against this class of genes (including but 
not limited to SLC26A1 -9, and related molecules) may be potentially helpful in both 
therapeutic and research venues. 
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Ion Exchangers 

Intracellular pH regulatory mechanisms are critical for the maintenance of 
countless cellular processes. For instance, in muscle cells, contractile processes and 
metabolic reactions are influenced by pH. During periods of increased energy 
5 demands and ischemia, muscle cells produce large amounts of lactic acid that, without 
quick and efficient disposal, would lead to acidification of the sarcoplasm. 

Several different transport mechanisms have evolved to maintain a relatively 
constant intracellular pH. The relative contribution of each of these processes varies 

10 with cell type, the metabolic requirements of the cell, and the local environmental 
conditions. Intracellular pH regulatory processes that have been characterized 
functionally include but are not limited to the NaVH* exchange, the Na(HC0 3 )„ 
cotransport, and the Na + -dependent and -independent CP/base exchangers. As 
bicarbonate and CO z comprise the major pH buffer of biological fluids, sodium 

1 5 biocarbonate cotransporters (NBCs) are critical. Studies have shown that these 

molecules exist in numerous tissues including the kidney, brain, liver, cornea, heart, 
and lung, suggesting that NBCs play an important role in mediating HC0 3 ~ transport 
in both epithelial as well as nonepithelial cells. 

20 Recent molecular cloning experiments have identified the existence of four 

NBC isoforms (NBC1, 2, 3 and 4) and two NBC-related proteins, AE4 and NCBE 
(Anion Exchanger 4 and Na-dependent Chloride-Bicarbonate Exchanger). The 
secondary structure^Hatyses and hydropathy profile of this family predict them to be 
intrinsic membrane proteins with 12 putative transmembrane domains and several 

25 family members exhibit iV-linked glycosylate sites, protein kinases A and C, casein 
kinase II, and ATP/GTP-binding consensus phosphorylation sites, as well as potential 
sites for myristylation and amidation. AE4 is a relatively recent addition to this 
family of proteins and shows between 30-48% homology with the other family 
members. When expressed in COS-7 cells and Xenopus oocytes AE4 exhibits sodium- 

30 independent and DIDS-insensitive anion exchanger activity. Exchangers have been 
shown to be responsible for a variety of human diseases. For instance, mutations in 
three genes of the anion transporter family (SLC) are believed to cause known 
hereditary diseases, including chondrodysplasia (SLC26A2, DTD), diarrhea (A3, 
down-regulated in adenoma/chloride-losing diarrhea protein: DRA/CLD), and 
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goiter/deaftiess syndrome (A4, pendrin). Moreover, mutations in Na+/HC03 co- 
transporters have also been associated with various human maladies. For these 
reasons, siKNAs directed against these sorts of genes (e.g. SLC4A4-10, and related 
genes) may be useful for therapeutic and research purposes. 

5 

Receptors Involved in Synaptic Transmission 

m all vertebrates, fast inhibitory synaptic transmission is the result of the 
interaction between the neurotransmitters glycine (Gly) and y-aminobutyric acid 
(GABA) and their respective receptors. The strychnine-sensitive glycine receptor is 
1 0 especially important in that it acts in the mammalian spinal cord and brain stem and 
has a well-established role in the regulation of locomotor behavior. 

Glycine receptors display significant sequence homology to several other 
receptors including the nicotinic acetylcholine receptor, the aminobutyric acid 

1 5 receptor type A (GABA A R), and the serotonin receptor type 3 (5-HT 3 R) subunits. As 
members of the superfamily of ligand-gated ion channels, these polypeptides share 
common topological features. The glycine receptor is composed of two types of 
glycosylated integral membrane proteins (al-a4 and P) arranged in a pentameric 
suprastructure. The alpha subunit encodes a large extracellular, N-terminal domain 

20 that carries the structural determinants essential for agonist and antagonist-binding, 

followed by four transmembrane spanning regions (TM1-TM4), with TM2 playing the 
critical role of forming the inner wall of the chloride channel. 



The density, location, and subunit composition of glycine neurotransmitter 
25 receptors changes over the course of development. It has been observed that the 

amount of GlyR gene translation (assessed by the injection of developing rat cerebral 
cortex mRNA into Xenopus oocytes) decreases with age, whereas that of GABARs 
increases. In addition, the type and location ofmRNAs coding for GlyR changes over 
the course of development. For instance in a study of the expression of alpha 1 and 
30 alpha 2 subunits in the rat, it was observed that (in embryonic periods El 1-1 8) the 
mantle zone was scarce in the alpha 1 mRNA, but the germinal zone (matrix layer) at 
El 1-14 expressed higher levels of the message. At postnatal day 0 (P0), the alpha 1 
signals became manifested throughout the gray matter of the spinal cord. By contrast, 
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the spinal tissues at PO exhibited the highest levels of alpha 2 mRNA, which 
decreased with the postnatal development. 

In both, man and mouse mutant lines, mutations of GlyR subunit genes result 
5 in hereditary motor disorders characterized by exaggerated startle responses and 
increased muscle tone. Pathological alleles of the Glral gene are associated with the 
murine phenotypes oscillator (spd 01 ) and spasmodic (spd). Similarly, a mutant allele 
of Glrb has been found to underly the molecular pathology of the spastic mouse (spa). 
Resembling the situation in the mouse, a variety of GLRA1 mutant alleles have been 
1 0 shown to be associated with the human neurological disorder hyperekplexia or startle 
disease. For these reasons, siRNA directed against glycine receptors (GLRA1-3, 
GLRB, and related molecules), glutamate receptors, GABA receptors, ATP receptors, 
and related neurotransmitter receptor molecules may be valuable therapeutic and 
research reagents. 

15 

Proteases 

i 

Kallikreins 

One important class of proteases are the kallikreins, serine endopeptidases that 
20 split peptide substrates preferentially on the C-terminal side of internal arginyl and 

* 

lysyl residues. Kallikreins are generally divided into two distinct groups, plasma 
kallikreins and tissue kallikreins. Tissue kallikreins represent a large group of 
enzymes that have substantial similarities at both the gene and protein level. The 
genes encoding this group are frequently found on a single chromosome, are 
25 organized in clusters, and are expressed in a broad range of tissues {e.g. pancreas, 
ovaries, breast). In contrast, the plasma form of the enzyme is encoded by a single 
gene {e.g. KLK3) that has been localized to chromosome 4q34-35 in humans. The 
gene encoding plasma kallikrein is expressed solely in the liver, contains 15 exons, 
and encodes a glycoprotein that is translated as a preprotein called prekallikrein. 

30 

Kallikreins are believed to play an important role in a host of physiological 
events For instance, the immediate consequence of plasma prekallikrein activation is 
the cleavage of high molecular weight kininogen (HK) and the subsequent liberation 
of bradykinin, a nine amino acid vasoactive peptide that is an important mediator of 
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inflammatory responses. Similarly, plasma kallikrein promotes single-chain urokinase 
activation and subsequent plasminogen activation, events that are critical to blood 
coaggulation and wound healing. 



5 Disruptions in the function of kallikreins have been implicated in a variety of 

pathological processes including imbalances in renal function and inflammatory 
processes. For these reasons, siKNAs directed against this class of genes (e.g. KLK1- 
1 5) may prove valuable in both research and therapeutic settings. 

10 ADAM Proteins 

The process of fertilization takes place in a series of discrete steps whereby the 
sperm interacts withri) the cumulus cells and the hyaluronic acid extracellular matrix 
(ECM) in which they are embedded, ii) the egg's own ECM, called the zona pellucida 
(ZP), and iii) the egg plasma membrane. During the course of these interactions, the 

1 5 "acrosome reaction," the exocytosis of the acrosome vesicle on the head of the sperm, 
is induced, allowing the sperm to penetrate the ZP and gain access to the perivitelline 
space. This process exposes new portions of the sperm membrane, including the inner 
acrosomal membrane and the equatorial segment, regions of the sperm head that can 
participate in initial gamete membrane binding. 

20 

The interactions of the gamete plasma membranes appear to involve multiple 
ligands and receptors and are frequently compared to leukocyte-endothelial 
interactions. These interactions-lead* to a series of signal transduction events in the 
egg, known as collectively as egg activation and include the initiation of oscillations 
25 in intracellular calcium concentration, the exit from meiosis, the entry into the first 
embryonic mitosis, and the formation of a block to polyspermy via the release of ZP- 
modifying enzymes from the egg's cortical granules. Ultimately, sperm and egg not 
only adhere to each other but also go on to undergo membrane fusion, making one 
cell (the zygote) from two. 

30 

Studies on the process of sperm-egg interactions have identified a number of 
proteins that are crucial for fertilization. One class of proteins, called the ADAM 
family (A Disintegrin And Metalloprotease), has been found to be important in 
spermatogenesis and fertilization, as well as various developmental systems including 
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myogenesis and neurogenesis. Members of the family contain a disintegrin and 
metalloprotease domain (and therefore have (potentially) both cell adhesion and 
protease activities), as well as cysteine-rich regions, epidermal growth factor (EGF> 
like domains, a transmembrane region, and a cytoplasmic tail. Currently, the ADAM 
gene family has 29 members and constituents are widely distributed in many tissues 
including the brain, testis, epididymis, ovary, breast, placenta, liver, heart, lung, bone, 
and muscle. 



One of the best-studied members of the ADAM family is fertilin, a 
1 0 heterodimeric protein comprised of at least two subunits, fertilin alpha and fertilin 
beta. The fertilin beta gene (ADAM2) has been disrupted with a targeting gene 
construct corresponding^ the exon encoding the fertilin beta, disintegrin domain. 
Sperm from males homozygous for disruptions in this region exhibit defects in 
multiple facets of sperm function including reduced levels of s^erm transit from the 
15 uterus to the oviduct, reduced sperm-ZP binding, and reduced s^erm-egg binding, all 
of which contribute to male infertility. 

Recently, four new ADAM family members (ADAM 24-27) have bfeen 
isolated. The deduced amino acid sequences show that all four contain the complete 
.20 domain organization common to ADAM family members and Northern Blot analysis v 
has shown all four to be specific to the testes. SiRNAs directed against this class of 
genes {e.g. ADAM2 and related proteins) may be useful as research tools and 
therapeutics directed toward fertility .and .birth control. 

25 Aminopeptidases 

Aminopeptidases are proteases that play critical roles in processes such as 
protein maturation, protein digestion in its terminal stage, regulation of hormone 
levels, selective or homeostatic protein turnover, and plasmid stabilization. These 
enzymes generally have broad substrate specificity, occur in several forms and play a 

30 major role in physiological homeostasis. For instance, the effects of bradykinin, 

angiotensin converting enzyme (ACE), and other vasoactive molecules are muted by 
one of several peptidases that cleave the molecule at ah internal position and eliminate 
its ability to bind its cognate receptor {e.g. for bradykinin, the B2-receptor). 
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Among the enzymes that can cleave bradykinin is the membrane bound 
aminopeptidase P, also referred to as aminoacylproline aminopeptidase, proline 
aminopeptidase; X-Pro aminopeptidase (eukaryote) and XPNPEP2. Aminopeptidase 
P is an aminoacylproline aminopeptidase specific for NH 2 -terminal Xaa-proline 
5 bonds. The enzyme i) is a mono-zinc-containing molecule that lacks any of the typical 
metal binding motifs found in other zinc metalloproteases, ii) has an active-site 
configuration similar to that of other members of the MG peptidase family, and iii) is 
present in a variety of tissues including but not limited to the lung, kidney, brain, and 
intestine. 

10 

Aminopeptidases play an important role in a diverse set of human diseases. 
Low plasma concentrations-of aminopeptidase P are a potential predisposing factor 
for development of angio-oedema in patients treated with ACE inhibitors, and 
inhibitors of aminopeptidase P may act as cardioprotectors against other forms of 
1 5 illness including, but not limited to myocardial infarction. For these reasons, siRNAs 
directed against this family of proteins (including but not limited to XPNPEP1 and 
related proteins) may be useful as research and therapeutic tools. 



Serine Proteases 

20 One important class of proteases are the serine proteases. Serine proteases 

share a common catalytic triad of three amino acids in their active site (serine 
(nucleophile), aspartate (electrophile), and histidine (base)) and can hydrolyze either 
esters or peptide bonds utilizing mechanisms-of covalent catalysis and preferential 
binding of the transition state. Based on the position of their introns serine proteases 

25 have been classified into a minimum of four groups including those in which 1) the 
gene has no introns interrupting the exon coding for the catalytic triad (e.g. the 
haptoglobin gene,); 2) each gene contains an intron just downstream from the codon 
for the histidine residue at the active site, a second intron downstream from the exon 
containing the aspartic acid residue of the active site and a third intron just upstream 

30 from the exon containing the serine of the active site (e.g. trypsinogen, 

chymotrypsinogen, kallikrein and proelastase); 3) the genes contain seven introns 
interrupting the exons coding the catalytic region (e.g. complement factor B gene); 
and 4) the genes contain two introns resulting in a large exon that contains both the 
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active site aspartatic acid and serine residues (e.g. factor X, factor IX and protein C 
genes). 



Cytotoxic lymphocytes (e.g. CD8(+) cytotoxic T cells and natural killer cells) 
5 form the major defense of higher organisms against virus-infected and transformed 
cells. A key function of these cells is to detect and eliminate potentially harmful cells 
by inducing them to undergo apoptosis. This is achieved through two principal 
pathways, both of which require direct but transient contact between the killer cell and 
its target. The first pathway involves ligation of TNF receptor-like molecules such as 

1 0 Fas/CD95 to their cognate ligands, and results in mobilization of conventional, 
programmed cell-death pathways centered on activation of pro-apoptotic caspases. 
The second mechanism consists*of a pathway whereby the toxic contents of a 
specialized class of secretory vesicles are introduced into the target cell. Studies over 
the last two decades have identified the toxic components as Granzymes, a family of 

1 5 serine proteases that are expressed exclusively by cytotoxic T lymphocytes and 
natural killer (NK) cells. These agents are stored in specialized lytic granules and 
enter the target cell via endocytosis. Like caspases, cysteine proteases that play an 
important role in apoptosis, granzymes can cleave proteins after acidic residues, 
especially aspartic acid, and induce apoptosis in the recipient cell. 

20 - ; 

Granzymes have been grouped into three subfamilies according to substrate 
specificity. Members of the granzyme family that have enzymatic activity similar to 
the serine protease chymotrypsin are encode<kby~a-gene cluster termed the r chymase 
locus 1 . Similarly, granzymes with trypsin-like specificities are encoded by the 
25 'tryptase locus 1 , and a third subfamily cleaves after unbranched hydrophobic residues, 
especially methionine, and are encoded by the 'Met-ase locus 1 . All granzymes are 
synthesized as zymogens and, after clipping of the leader peptide, obtain maximal 
enzymatic activity subsequent to the removal of an amino-terminal dipeptide. 

30 Granzymes have been found to be important in a number of important 

biological functions including defense against intracellular pathogens, graft versus 
host reactions, the susceptibility to transplantable and spontaneous malignancies, 
lymphoid homeostasis, and the tendency toward auto-immune diseases. For these 
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reasons, siRNAs directed against granszymes (e.g. GZMA, GZMB, GZMH, GZHK, 
GZMM) and related serine proteases may be useful research and therapeutic reagents. 

Kinases 

Protein Kinases (PKs) have been implicated in a number of biological 
processes. Kinase molecules play a central role in modulating cellular physiology and 
developmental decisions, and have been implicated in a large list of human maladies 
including cancer, diabetes, and others. 

During the course of the last three decades, over a hundred distinct protein 
kinases have been identified, all with presumed specific cellular functions. A few of 
these enzymes have been isolated to sufficient purity to perform in vitro studies, but 
most remain intractable due to the low abundance of these molecules in the cell. To 
counter this technical difficulty, a number of protein kinases have been isolated by 
molecular cloning strategies that utilize the conserved sequences of the catalytic 
domain to isolate closely related homologs. Alternatively, some kinases have been 
purified (and subsequently studied) based on their interactions with other molecules. 

p58 is a member of the P 34cdc2-related supergene family and contains a large 
domain that is highly homologous to the cell division control kinase, cdc2. This new 
cell division control-related protein kinase was originally identified as a component of 
semipurified galactosyltransferase; thus, it has been denoted galactosyltransferase- 
associated protein kinase (GTA-kinase). GTAJsmaseJias been found to be expressed 
in both adult and embryonic tissues and is known to phosphorylate a number of 
substrates, including histone HI, and casein. Interestingly enough, over expression of 
this molecule in CHO cells has shown that elevated levels of p58 result in a prolonged 
late telophase and an early Gl phase, thus hinting of an important role for GTA- 
kinase in cell cycle regulation. 

Cvclin Dependent Kinases 

The cyclin-dependent kinases (Cdks) are a family of highly conserved 
serine/threonine kinases that mediate many of the cell cycle transitions that occur 
during duplication. Each of these Cdk catalytic subunits associates with a specific 
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subset of regulatory subunits, termed cyclins, to produce a distinct Cdk-cyclin kinase 
complex that, in general, functions to execute a unique cell cycle event. 

Activation of the Cdk-cyclin kinases during cellular transitions is controlled by 
5 a variety of regulatory mechanisms. For the Cdc2-cyclin B complex, inhibition of 
kinase activity during S phase and G 2 is accomplished by phosphorylation of two 
Cdc2 residues, Thr 14 and Tyr 15 , which are positioned within the ATP-binding cleft. 
Phosphorylation of Thr 14 and/or Tyr 15 suppresses the catalytic activity of the molecule 
by disrupting the orientation of the ATP present within this cleft. In contrast, the 

1 0 abrupt dephosphorylation of these residues by the Cdc25 phosphatase results in the 
rapid activation of Cdc2-cyclin B kinase activity and subsequent downstream mitotic 
events. While the exact details of this pathway have yet to be elucidated, it has been 
proposed that Thr 14 /Tyr 15 phosphorylation functions to permit a cell to attain a critical 
concentration of inactive Cdk-cyclin complexes, which, upon activation, induces a 

1 5 rapid and complete cell cycle transition. Furthermore, there is evidence in 
mammalian cells that Thr 14 /Tyr 15 phosphorylation also functions to delay Cdk 
activation after DNA damage. 

The Schizosaccharomyces pombe weel gene product was the first kinase 
20 identified that is capable of phosphorylating Tyr 15 in Cdc2. Homologs of the Weel 
kinase have been subsequently identified and biochemically characterized from a wide 
range of species including human, mouse, frog, Saccharomyces cerevisiae, and 
Drosophila. In vertebrate systems, where Thr 14 dikGde2ds.also phosphorylated, the 
Weel kinase was capable of phosphorylating Cdc2 on Tyr 15 , but not Thr 14 , indicating 
25 that another kinase was responsible for Thr 14 phosphorylation. This gene, Mytl 
kinase, was recently isolated from the membrane fractions of Xenopus egg extracts 
and has been shown to be capable of phosphorylating Thr 14 and, to a lessor extent, 
Tyr 15 in Cdc2. A human Mytl homolog displaying similar properties has been 
isolated, as well as a non-membrane-associated molecule with Thr 14 kinase activity. 



30 



In the past decade it has been shown that cancer can originate from 
overexpression of positive regulators, such as cyclins, or from underexpression of 
negative regulators {e.g. pl6 (INK4a), pl5 (INK4b), p21 (Cipl)). Inhibitors such as 
Mytl are the focus of much cancer research because they are capable of controlling 
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cell cycle proliferation, now considered the Holy Grail for cancer treatment. For these 
reasons, siRNA directed against kinases and kinase inhibitors including but not 
limited to ABL1, ABL2, ACK1, ALK, AXL, BLK, BMX, BTK, C20orf64, CSF1R, 
SCK, DDR1, DDR2, DKFZp761P1010, EGFR, EPHA1, EPHA2, EPHA3, EPHA4, 

5 EPHA7, EPHA8, EPHB1, EPHB2, EPHB3, EPHB4, EPHB6, ERBB2, ERBB3, 
ERBB4, FER, FES, FGFR1, FGFR2, FGFR3, FGFR4, FGR, FLT1, FLT3, FLT4, 
FRK, FYN, HCK, IGF1R, INSR, ITK, JAK1, JAK2, JAK3, KDR, KIAA1079, KIT, 
LCK, LTK, LYN, MATK, MERTK, MET, MST1R, MUSK, NTRK1 , NTRK2, 
NTRK3, PDGFRA, PDGFRB, PTK2, PTK2B, PTK6, PTK7, PTK9, PTK9L, RET, 

1 0 ROR1 , ROR2, ROS 1 , RYK, SRC, SYK, TEC, TEK, TIE, TNK1 , TXK, TYK2, 
TYR03, YES1, and related proteins, maybe useful for research and therapeutic 
purposes. 

G Protein Coupled Receptors 

15 

One important class of genes to which siRNAs can be directed are G-protein 
coupled receptors (GPCRs). GPCRs constitute a superfamily of seven transmembrane 
spanning proteins that respond to a diverse array of sensory and chemical stimuli, 
such as light, odor, taste, pheromones, hormones and neurotransmitters. GPCRs play a 
20 central role in cell proliferation, differentiation, and have been implicated in the 

% * 

etiology of disease. 

The mechanism by which G protein-coupled receptors translate extracellular 
signals into cellular changes was initially envisioned as a simple linear model: 

25 activation of the receptor by agonist binding leads to dissociation of the heterotrimeric 
GTP-binding G protein (Gs, Gi, or Gq) into its alpha and beta/gamma subunits, both 
of which can activate or inhibit various downstream effector molecules. More 
specifically, activation of the GPCR induces a conformational change in the Ga 
subunit, causing GDP to be released and GTP to be bound in its place. The Ga and 

30 Gpy subunits then dissociate from the receptor and interact with a variety of effector 
molecules. For instance in the case of the Gs family, the primary function is to 
stimulate the intracellular messenger adenylate cyclase (AC), which catalyzes the 
conversion of cytoplasmic ATP into the secondary messenger cyclic AMP (cAMP). 
In contrast, the Gi family inhibits this pathway and the Gq family activates 
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phospholipases C (PLC), which cleaves phosphatidylinositol 4,5, bisphosphate (PIP2) 
to generate inositol- 1,4,5-phosphate (IP3) and diacylglycerol (DAG). 

More recently, studies have shown that the functions of GPCRs are not limited 
5 to their actions on G-proteins and that considerable cross-talk exists between this 
diverse group of receptor molecules and a second class of membrane bound proteins, 
the receptor tyrosine kinases (RTKs). A number of GPCRs such as endothelin-1 , 
thrombin, bombesin, and dopamine receptors can activate MAPKs, a downstream 
effector of the RTK/Ras pathway. Interestingly, the interaction between these two 
1 0 families is not unidirectional and RTKs can also modulate the activity of signaling 
pathways traditionally thought to be controlled exclusively by ligands that couple to 
GPCRs. For instance, EGF, which normally activates the MAPK cascade via the 
EGF receptor can stimulate adenylate cyclase activity by activating Gas. 

1 5 There are dozens of members of the G Protein-Coupled Receptor family that 

have emerged as prominent drug targets in the last decade. One non-limiting list of 
potential GPCR-siRNA targets is as follows: 

CMKLR1 

20 CML1/ GMKLR1 (Accession No. Q99788) is a member of the chemokine 

receptor family of GPCRs that may play a role in a number of diseases including 
those involved in inflammation and immunological responses {e.g. asthma, arthritis). 
For this reason, siRNA directed against this protein may-proVe-to-be important 
therapeutic reagents. 

25 

Studies of juvenile-onset neuronal ceroid lipofuscinosis (JNCL, Batten 
disease), the most common form of childhood encephalopathy that is characterized by 
progressive neural degeneration, show that it is brought on by mutations in a novel 
lysosomal membrane protein (CLN3). In addition to being implicated in JNCL, 
30 CLN3 (GPCR-like protein, Accession No. A5721 9) expression studies have shown 
that the CLN3 mRNA and protein are highly over-expressed in a number of cancers 
{e.g. glioblastomas, neuroblastomas, as well as cancers of the prostate, ovaries, breast, 
and colon) suggesting a possible contribution of this gene to tumor growth. For this 
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reason, siRNA directed against this protein may prove to be important therapeutic 
reagents. 



CLACR 



The calcitonin receptor (CTR/ CALCR, Accession No. NM 001742) belongs 
to "family B" of GPCRs which typically recognized regulatory peptides such as 
parathyroid hormone, secretin, glucagons and vasoactive intestinal polypeptide. 
Although the CT receptor typically binds to calcitonin (CT), a 32 amino acid peptide 
hormone produced primarily by the thyroid, association of the receptor with RAMP 
(Receptor Activity Modulating Protein) enables it to readily bind other members of 
the calcitonin peptide family including amylin (AMY) and other CT gene-related 
peptides {e.g. aCGRP and PCGRP). While the primary function of the calcitonin 
receptor pertains to regulating osteoclast mediated bone resorption and enhanced Ca +2 
excretion by the kidney, recent studies have shown that CT and CTRs may play an 
important role in a variety of processes as wide ranging as embryonic/foetal 
development and sperm function/physiology. In addition, studies have shown that 
patients with particular CTR genotypes may be at higher risk to lose bone mass and 
that this GPCR may contribute to the formation of calcium oxalate urinary stones. For 
this reason, siRNA directed against CTR may be useful as therapeutic reagents. 



OXTR 



The human oxytocin receptor (OTR, OXTR) is a 389 amino acid polypeptide 
that exhibits the seven transmembrane domain structure and belongs-to the Class-I 
(rhodopsin-type) family of G-protein coupled receptors. OTR is expressed in a wide 
variety of tissues throughout development and mediates physiological changes 
through G(q) proteins and phospholipase C-beta. Studies on the functions of oxytocin 
and the oxytocin receptor have revealed a broad list of duties. OT and OTR play a 
role in a host of sexual, maternal and social behaviors that include egg-laying, birth, 
milk-letdown, feeding, grooming, memory and learning. In addition, it has been 
hypothesized that abnormalities in the functionality of oxytocin-OTR receptor-ligand 
system can lead to a host of irregularities including compulsive behavior, eating 
disorders (such as anorexia), depression, and various forms of neurodegenerative 
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diseases. For these reasons, siRNA directed against this gene (NM_000916) may play 
an important role in combating OTR-associated illnesses. 



EDG GPCRs 

5 Lysophosphatidic acid and other lipid-based hormones/growth factors induce 

their effects by activating signaling pathways through the G-protein coupled receptors 
(GPCRs) and have been observed to play important roles in a number of human 
diseases including cancer, asthma, and vascular pathologies. For instance, during 
studies of immunoglobulin A nephropathy (IgAN), researchers have observed an 
10 enhanced expression of EDG5 (NPJ304221) suggesting a contribution of this gene 
product in the development of IgAN. For that reasons, siRNA directed against Edg5 
(NM_004230), Edg4 (NM__004720), Edg7 (Nm_012152) and related genes may play 
an important role in combating human disease. 

15 Genes Involved in Cholesterol Signaling and Biosynthesis 

Studies on model genetic organisms such as Drosophila and C elegans have 
led to the identification of a plethora of genes that are essential for early development. 
Mutational analysis and ectopic expression studies have allowed many of these genes 
to be grouped into discreet signal transduction pathways and have shown that these 

20 elements play critical roles in pattern formation and cell differentiation. Disruption of 
one or more of these genes during early stages of development frequently leads to 
birth defects whereas as alteration of gene function at later stages in life can result in 
tumorigenesis. 



25 One critical set of interactions known to exist in both invertebrates and 

vertebrates is the Sonic Hedgehog-Patched-Gli pathway. Originally documented as a 
Drosophila segmentation mutant, several labs have recently identified human and 
mouse orthologs of many of the pathways members and have successfully related 
disruptions in these genes to known diseases. Pathway activation is initiated with the 

30 secretion of Sonic hedgehog. There are three closely related members of the Shh 
family (Sonic hedgehog, Desert, and Indian) with Shh being the most widely 
expressed form of the group. The Shh gene product is secreted as a small pro-signal 
molecule. To successfully initiate its developmental role, Shh is first cleaved, 
whereupon the N-terminal truncated fragment is covalently modified with cholesterol. 
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The addition of the sterol moiety promotes the interaction between Shh and its 
cognate membrane bound receptor, Patched (Ptch). There are at least two isoforms of 
the Patched gene, Ptchl and Ptch2. Both isoforms contain a sterol-sensing domain 
(SSD); a roughly 1 80 amino acid cluster that is found in at least seven different 
classes of molecules including those involved in cholesterol biosynthesis, vesicular 
traffic, signal transduction, cholesterol transport, and sterol homeostasis In the 
absence of Shh, the Patched protein is a negative regulator of the pathway. In contrast 
bmdmg of Shh-cholesterol to the Patched receptor releases the negative inhibition 
which that molecule enforces on a G-protein coupled receptor known as Smoothened 
Subsequent activation of Smoothened (directly or indirectly) leads to the triggering of 
a trio of transcription factors that belong to the Gli family. All three factors are 
relatively large, contain a characteristic C2-H2 zmc-fmgerpfcntamer, and recognize 
one of two consensus sequences (SEQ. ID NO. 0463 GACCACCCA or SEQ ID NO 
0464 GAACCACCCA). m the absence of Shh, Gli proteins are cleaved by the 
proteosome and the C-terminally truncated fragment translocates to the nucleus and 
acts as a dominant transcription repressor. In the presence of Shh-cholesterol Gli 
repressor formation is inhibited and full-length Gli functions as a transcriptional 
activator. 

Shh and other members of the Shh-PTCH-Gli pathway are expressed in a 
broad range of tissues {e.g. the notochord, the floorplate of the neural tube, the brain 
and the gut) at early stages in development. Not surprisingly, mutations that lead to ' 
altered protein expression or function have been shown to induce developmental 
abnormalities. Defects in the human Shh gene have been shown to cause 
holoprosencephaly, a midline defect that manifests itself as cleft lip or palate, CNS 
septation, and a wide range of other phenotypes. Interestingly, defects in cholesterol 
busynthesis generate similar Shh-like disorders (e.g. Smith-Lemli-Opitz syndrome) 
suggesting that cholesterol modification of the Shh gene product is crucial for 
pathway function. Both the Patched and Smoothened genes have also been shown to 
be clinically relevant with Smoothened now being recognized as an oncogene that 
hke PTCH-1 and PTCH-2, is believed to be the causative agent of several forms of 
adult tumors. For these reasons, siRNA directed against Smoothened (SMO 
NM_00563 1), Patched (PTCH, nm_000264), and additional genes that participate in 
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cholesterol signaling, biosynthesis, and degradation, have potentially useful research 
and therapeutic applications. 

Tar geted Pathways . 

5 In addition to targeting siRNA against one or more members of a family of 

proteins, siRNA can be directed against members of a pathway. Thus, for instance, 
siRNA can be directed against members of a signal transduction pathway (e.g. the 
insulin pathway, including AKT1-3, CBL, CBLB, EIF4EBP1, FOXOIA, FOX03A, 
FRAP1, GSK3A, GSK3B, IGF1, IGF1R, INPP5JD, INSR, IRS1, MLLT7, PDPK1, 
10 PIK3CA, PIK3CB, PIK3R1, PIK3R2, PPP2R2B, PTEN, RPS6, RPS6KA1, 

RPX6KA3, SGK, TSC1, TSC2, AND XPOl), an apoptotic pathway (CASP3, 6,7,8,9, 
DSH1/2, PI 10, P85, PDK1/2, CATENIN, HSP90, CDC37, P23, BAD, BCLXL, 
BCL2, SMAC, and others), pathways, involved in DNA damage, cell cycle, and other 
physiological (p53,MDM2, CHK1/2, BRCA1/2, ATM, ATR, P15INK4, P27, P21, 
1 5 SKP2, CDC25C/A, 14-3-3, PLK, RB, CDK4, GLUT4, Inos, Mtor, FKBP, PPAR, 
RXR, ER). Similarly, genes involved in immune system function including TNFR1, 
IL-IR, IRAKI/2, TRAF2, TRAF6, TRADD, FADD, IKKs, IKKy, IKK|3, IKKa, 
IkBa, HcBP, p50, p65, Rac, RhoA, Cdc42, ROCK, Pakl/2/3/4/5/6, cIAP, HDAC1/2, 
CBP, p-TrCP, Rip2/4, and others are also important targets for the siRNAs described 
20 in this document and may be useful in treating immune system disorders. Genes 

involved in apoptosis, such as Dshl/2,PTEN 5 PI 10 (pan), P85, PDK1/2, Aktl, Akt2, 
Akt (pan), p70 s6K , GSK3p, PP2A (cat), (J-catenin, HSP90, Cdc37/p50, P23, Bad, 
BclxL, Bcl2, Smac/Diablo, and Askl are potentially useful in the treatment 'oi ^ 
diseases that involve defects in programmed cell death (e.g. cancer), while siRNA 
25 agents directed against p53 5 MDM2, Chkl/2, BRCA1/2, ATM, ATR, pi S 1 ** 4 , P27, 
P21, Skp2, Cdc25C/A, 14-3-3a/e, PLK, Rb, Cdk4, Glut4, iNOS, mTOR, FKBP, 
PPARy, RXRa, ERq and related genes may play a critical role in combating diseases 
associated with disruptions in DNA repair, and cell cycle abnormalities. 

30 Tables VI -Table X below provide examples of useful pools for inhibiting 

different genes in the human insulin pathway and tyrosine kinase pathways, proteins 
involved in the cell cycle, the production of nuclear receptors, and other genes. These 
particular pools are particularly useful in humans, but would be useful in any species 

1 m 
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that generates an appropriately homologous mRNA. Further, within each of the listed 
pools any one sequence maybe used independently but preferably at least two of the 
hsted sequences, more preferably at least three, and most preferably all of the listed 

sequences for a given gene is present. 



Table VI 



Gene 
Name 



AKT1 



AKT1 



AKT1 



AKT1 
AKT2 



AKT2 



Acc# 



Gl 



NM Q05163 1 4885060 



NM 005163 1 4885060 



NM 005163 Uaasnm 



NM 005163 U aasnfin 



AKT2 
AKT2 



NM 001626 6715585 
NM_ 001626 6715585 
IMM_0Q1626 67155S5 



AKT2 NM 001626 67155S5 



AKT3 



AKT3 



AKT3 NM 005465 1 32307164' 



AKT3 NM 005465 393071 R4 



AKT3 



CBL 



NM 005465 32307164" 
NM_005465 [32307164" 



LL 



207 



207 



208 
208 
208 



208 



Duplex# I Sequence 



P-003000-05 GACAAGGACG finHAr atta 



v v \ ^^vnnvjvjnUQVjUUAUA I I A 

D-003000-Q6 GGACAAGGACGfiftP.A r atT 
D-0030U0-Q/ GCTACTTCCTCnTn AA^AA - 

H.nnQnnn nolr»flArt^^ ^ — 



D-003000-08 GACCGCCTCTGCTTTGTCA 



D-003001-05 GTACTTCGATGATGAATTT 
D-003001-OH GCAAAGAGGGCATHAfra A 

D-003001 -07 GGGCTAAAGTGACCATG AA~ 
D-003001-O8 GCAGAATGCCAGCTGATGA 



10000 D-0030 02-05 GGAGTAAArrrr^ rA A^iyp- 



CBL NM 005188 488511 r a^f 



1 0000 D-003 002-06 GACATTAAATTTCCT CnA/T 
10000 D-Qua 002-07 GACCAAAfinr AA Ar Ar^yff 
1 0000 D-UQ30 02-08 GAGGAG AGAATGAATTGTA 



CBL N M 005188 488filiR U^y 



CBLB 



CBLb NM 170662 29366807 868 



CBLb N M 170662 293RRBn7' ««« 



CBLB 



CBLB 



NM 170662 29366807 868 
NM 170662 1 293668071868 



D-003004-05 GACCATACCTC AT A AP A a re 
D-003004-U6 TGAAAGACCTCCArr a atp 

D-0030U4-U/ GATGAAGGCT OP.Af^Trvrr 

n_nn^nnyi no t-at^ a^%^ — . — 



EIF4EBP1 

"^MINmS , 0030 . 05 " n6 ^^TAGCC^Afi aSSTT 

FOXQ1A MM 1104085 200701 ^ M978 D-0030Q5..08 CAATAfin nCAGAAGATAAO 

foxoiaS 

^^^^u ^U15 9257221 2308 D-00 3006-08 GAGGTATH AGTCAGTATAA 



FOXQ3A |NM 00 1455 



FQXQ3A |NM~001455 



FOXQ3A NM~001 455 



FOXQ3A |NM 00 1455 
FRAP1 



FRAP1 NM Q04Q 5ft 



4503738 2309 U-003007-01 HAATa^^a,^^ 
4503738 93na n.n n^nn-7 no L„ ^ ™- - U ' A ' AUO 



4503738 
4503738 



4503738 



FRAP1 NM 00 4Q5S 



FRAP1 NM 00 4Q5ft 



FRAP1 |nM 00405ft 



19924298 



1 9924298 



1 9924298 



1 9924293 



v , Vl unniMU ^ MMW U I A I AUG 

2309 D-003007-Q2 1 TGAAGTCHA^ ap^ A, T r, aj 
2309lD^ 0300^Q3 TGTC AC ACT ATCCTZpy^T 



2309 U-U03UU/ -Q4 TGTTCAATGGGAGCTTGGA~ 



2 t 75 P - 003008 - 05 GAGAAGAAA TGGAAGAAAT 



i ! i ii m m ^^Atio/WA I t-j QAAGAAAT 

M^liiP^P^^S^AG^C^ 



— — — wv^-^i vj nooMivjuu^ i CJAATAATA 

2475 | P-0 030U8-08 I GGTCTG A ACTGAATGAAGA 



SEQ. 
I D NO 
465 
466 



467 
468 



469 
470 



b-003003-05 GGAGACACATTT^n ATTA | 477 
D-0030U3-06 GATCTGACCTGCAAT ^att ^ 



CBL NM 005188 UaSMifi 1 . ^ ■ GACCTG CAATGATT 14 7ft 



D-003004-Ud TATCAGCATTTACGACTTA 4ftZ 



488 



489 



490 



491 



492 



493 
494 



495 



496 



497 



498 



499 



500 
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GSK3A 



GSK3A 1NM 019884 



GSK3A |NM 019884 



GSK3A INM 019884 



GSK3A iNM 019884 



GSK3B 



GSK3B INM 002093 



GSK3B [NM 002093 



GSK3B INM 002093 



11995473 



11995473 



11995473 



11995473 



21361339 



21361339 
21361339 



GSK3B INM 002093 



IGF1 



1GF1 NM 000618 
IGpj INM 000618 



IGF1 



NM 000618 



1GF1 



NM 000618 



IGF1R 



1GF1R INM 000875 



IGF1R INM 000875 



1GF1R INM 000875 



IGF1R INM 000875 



INPP5D 



21361339 



2931 



2931 



2931 



D-003009-05 1 GGACAAAGGTGTTCAAAT C 501 

— ■ I ■ _ . " — . . a a a I CfV"! 



~ D-003009-06 GAACCCAGCTGCCTAA CAA I 502 



2931 



2932 



2932 
2932 



P-003Q09-07 GCGCACAGCTTCTTTGATG 
D-003009-08 GCTCTAGCCTGCTGGAGTA 



503 
504 



505 
506 



2932 



D-003010-05 GAAGAAAGATGAGGTCTAT 
D-00301 0-06 GGACCCAAATGTCAAACTA 
D-00301 0-07 GAAATGAACCCAAACTACA 
D-0030 10-081 GATGAGGTCTATCTTAATC 508 



507 



0-00301 1-05 1 GGAAGTACATTTGAAGAAC 509 



D-00301 1 -06 AGAAGGAAGTACATTTGAA 
D-00301 1-07 CCTCAAGCCTGCCAAGTCA 



1 1 068002 1 3480 



D-00301 1-08 GGTGGATGCTCTTCAGTTC 
D-00301 2-05 CAACGAAGCTTCTGTGATG 



510 
511 
512 

513 



1 1 nfiR002 3480 D-00301 2-06 GGCCAGAAATGGAGAATAA 



1 1 nfia002 3480 D-00301 2-07 GAAGCACCCTTTAAGAATG 



1 1 068002 1 3480 I D-00301 2-08 GCAGACACCTACAACATCA 



1NPP5D INM 005541 



INPP5D I NM 005541 



1NPP5D 
INPP5D 



NM 005541 
NM 005541 



5031 798 



5031 798 



, 3635 
1 3635 



n-0Q301 3-05 1 GGAATTGCGTTTACACTTA 



514 



515 



516 



D-00301 3-06 1 GGAAACTGATCATTAAGAA 



5031798 
5031798 



3635 
3635 



D-00301 3-07 1 CGACAGGGATGAAGTACAA 



D-00301 3-08 1 AAACGCAGCTGCCCATCTA 



517 



518 
519 



520 



INSR 



INSR 



INSR 



D^03014-05 GGAAGACGTTTGAGGATTA 
"5^00301 4-06 GAACAAGGCTCCCGAGAGT 
NM 000208 I4557883 I3643 lM Q30144)7 G GAG AG ACCTTGG AAA I JG 
Snn^ 4557883 3643 D-0 0301 4-08 GGACGG AACC CACCT AT! T 



INSR NM 000208 [4557883 1 3643 
INSR INM 000208 



4557883 
4557883 



3643 
3643 



521 



522 
523 



IRS1 



IRS1 



finaift04 3667 D-0030 1 5-05 AAAGAGGTCTGGCAAGTGA 



IRS1 



IRS1 |NM 00554 4 15031804 |366~7 D - 0 0 301 5-0 7 ^ 0°^^^^ 
ifei MM 00^11 U™ «™ MR7 D-00301 5-08 GTCAG TC li CGTCCAGTA 



IRS1 
MLLT7 



MLLT7 INM 005938 5174578 
MLLT7 |NM 005938 5174578 



D-00301 6-05 GGACTGGACTTCMCTTTG 
D-00301 6-06 CCACGAAGCAGTTCAAAXG 



524 



525 



526 



527 



528 



529 
530 



MLLT7 iNM 00593 * 1 5174578 I4303 D -003 01 6-07 j ^^^ ^^^^^p^^Tp y 
L» nncnoo U^ awr Uffla I D -00301 6-08 1 GAOGAGAGATCGCTAACCA 



MLLT7 I NM 005938 [5174578 
PDPK1 
PDPK1 



mm nn9RH 45 05694 5170 D-00301 7-05 CAAGAGACCTCGTGGAGAft 



531 
532 



533 



PDPK1 (NM 002613 



Afflfififtd 5170 D-00301 7-06 GACCAGAGGCCAAGAATTT 534 



PDPK1 



PDPK1 



PIK3CA 
PIK3CA 



PIK3CA 



P1K3CA 



PIK3CA 



PIK3CB 
P1K3CB 



PIK3CB 



PIK3CB 



P1K3CB 
PIK3R1 



NM 002613 



NM 002613 



NM 006218 



NM 006218 



NM 006218 



4505694 
4505694 



5453891 
5453891 



NM 006218 



NM 006219 



NM 006219 



NM 006219 



NM 006219 



5453891 



5453891 



5453893 



5453893 



5453893 



5453893 



5170 
5170 



D-00301 7-07 GGAAACGAGTATCTTATAT, 
D-00301 7-08 GAGAAGCGACATATCATAA 



535 
536 



5290 I D-0030 1 8-05 GCTATCATCTG AA C AATTA | 537 
5290 D-00301 8-06 GGATAGAGGCCAAATAATA 
5290 I D-00301 8-07 GGACAACTGTTTCATATAG 
550301 8-08 GCCAGTACCTCATGGATT A 

n-n0301 9-05 CGACAAGACTGCCGAG AGA 541 



5290 



5291 



5291 



5291 
5291 



538 
53S 
54C 



D-00301 9-06 1 T CAAGTGTCTCCTAATA TG 54; 

I * ■ I W At 



D-00301 9-07 GGATTCAGTTGGAGTGATT 
D-00301 9-08 TTTCAAGTGTCTCCTAATA^ 



54; 

54' 



10S 
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V ll\0K I 


NM 1oloU4 




cone 

5295 


D-003020-05 


GGAAATATGGC M CTCTGA 


545 


KII\oK1 


Mft AO A Cf\A 


o2455251 


5295 


D-003020-06 


GAAAGACGAGAGACCAATA 


546 


Dll/OD-1 

rlr\oK1 


KIIV/I A OA Cfijl 

NM 1o1oU4 


32455251 


cone 

5295 


D-003020-07 


GTAAAGCATTGTGTCATAA 


C A ~7 

547 


r .l\oK1 


Ml\/I A OA Cf\A 

NIVl lo1oU4 


OO/I CCOEA 

o24oozb1 


cone 

5295 


D-003020-08 


/"> s-\ a *T"/^> A A rOTO AAA A A 

G G ATC AAGTTGTC AAAG AA 


c a a 

548 


PIK3K2 














□ ll/ODO 


NM 005027 


4826907 


5296 


D-003021-05 


GGAAAGGCGGGAACAATAA 


549 


PIK3R2 


NM 005027 


4826907 


5296 


D-003021-06 


G ATG AAG CGT ACTG C AATT 


550 


PIK3R2 


NM 005027 


4826907 


5296 


D-003021-07 


GGACAGCGAATCTCACTAC 


r - jT - jl 

551 




NM 005027 


4826907 


5296 


D-003021-08 


GCAAGATCCGAGACCAGTA 


l~ f~ o 

552 


PPP2R2B 














PPP2R2B 


NM 004576 


4758953 


5521 


D-003022-05 


GAATGCAGCTTAC 1 I 1 CTT 


553 


PPP2R2B 


NM 004576 


4758953 


5521 


D-003022-06 


GACCGAAGCTGACATTATC 


554 


PPP2R2B 


NM 004576 


4758953 


5521 


D-003022-07 


TCGATTACCTGAAGAGTTT 


555 


PPP2R2B 


NM 004576 


4758953 


5521 


D-003022-08 


CCTGAAGAGTTTAGAAATA 


f" t— 

556 


PTEN 














PTEN 


NM 000314 


4506248 


5728 


D-003023-05 


^fcfc ^ 'fc m Jt J< V^ Jk 1 ■ 1 ^W>^^ JI JAMA. At Jj. VWH jrifkk. 

GTGAAGATCTTGACCAATG 


557 


PTEN 


NM 000314 


4506248 


5728 


D-003023-06 


J A**^ m ™ ■ ■ A j a^^ A * >■ Jt j"k Jt AAA T ■« ■ A 

GATCAGCATACACAAATTA 


556 


PTEN 


NM 000314 


4506248 


5728 


D-003023-07 


.Ah j jA^ Jk 1 A J A_^ JL Jh«j_4 - ^^^V-^M jk 

GGCGCTATGTGTATTATTA 


55G 


PTEN 


NM 000314 


4506248 


5728 


D-003023-08 


■ A A A a*"V A*~\ a+*% ■ ' jj - ^ a**X A jf^*k A "f ™ A A 

GTATAGAGCGTGCAGATAA 


56C 


RPS6 














RPS6 


NM 001010 


17158043 


6194 


D-003024-05 


GCCAGAAACTCATTGAAGT 


561 


RPS6 


■V I A ■ jA jT*k J y> J 

NM 001010 


1 71 58043 


6194 


D-003024-06 


GGATATTCCTGGACTGACT 


562 


RPS6 


NM 001010 


17158043 


6194 


D-003024-07 


CCAAGGAGAACTGGAGAAA 


56c 


RPS6 


NM 001010 


17158043 


6194 


D-003024-08 


GCGTATGGCCACAGAAGTT 


564 


W^^. A*k. 1 yr A J 

RPS6KA1 














RPS6KA1 


NM 002953 


20149546 


6195 


D-003025-05 


GATGACACCTTCTACTTTG 


56£ 


RPS6KA1 


NM 002953 


20149546 


6195 


D-003025-06 


GAGAATGGGCTCCTCATGA 


566 


RPS6KA1 


NM 002953 


20149546 


6195 


D-003025-07 


^ ^ Jt Jfe ^AAk ^AA> Jb MA) d<k. ^tfek.^^_ v ^ B ^ k. ja J k. jl jb 

CAAGCGGGATCCTTCAGAA 


567 


RPS6KA1 


kill ^x ^X ^X 

NM 002953 


20149546 


6195 


D-003025-08 


^^■fc ^Hfek Jk ^^^k^ ^ABfe, _^^lk Jk Jfe& Jl^k. v^^V ^^b. Jh ^^^^ Ak Jfe ^^*a. Jk. 

CCACCGGCCTGATGGAAGA 


56£ 


r^x #^ /a, i yf a 

RPS6KA3 














RPS6KA3 


NM 004586 


4759049 


6197 


D-003026-05 


GAAGGGAAGTTGTATCTTA 


56£ 


RPS6KA3 


till gr\ j ^i 

NM 004586 


4759049 


6197 


D-003026-06 


GAAAGTATGTGTATGTAGT 


57C 


RPS6KA3 


NM 004586 


4759049 


6197- ^ 


O-003026-07 


^^^•k ^^^fc. jh ^ABfeh JL ^MAk. jk jdp^kk .j^^ jk Jk Jk ^ IB W Jk Jk 

GGACAGCATCCAAACATTA 


571 


RPS6KA3 


NM 004586 


4759049 


6197 


D-003026-08 


GGAGGTGAATTGCTGGATA 


572 


SGK 














SGK 


NM 005627 


^x ^x 

5032090 


6446 


D-003027-01 


frW^V* Jfel m AAA jJBV V^BH- .^Ak, ^_^k a. rfk A ______ jk ^l^_^__A 

TTAATGGTGGAGAGTTGTT 


57c 


SGK 


NM 005627 


5032090 


6446 1 


D-003027-04 


ATTAACTGGGATGATCTCA 


574 


SGK 


NM 005627 


^ ^X ^X ^X ^X 

25168262 


6446 


D-003027-05 


jjkkktv Jk Jk .^P^. _ _ _ .^tf^ Mf 1 ^. _ Jl ^^^t* jk MAI Ji Jk m 

GAAGAAAGCAATCCTGAAA .. 


~ 57e 


SGK 


NM 005627 


25168262 


A A 

6446 


D-003027-06 


AAA JK 4k. A aP*K. Al> ^^k. _ _ _ * ^^k. _ j«k. «-A- 

AAACACAGCTGAAATGTAC 


57€ 


■ » ■ /— » y"», J 

TSC1 














TSC1 


NM 000368 


24475626 


7248 


D-003028-05 


GAAGATGGCTATTCTGTGT 


577 


TSC1 


NM 000368 


24475626 


7248 


I *k A^ ^X ^\ J*V A*X 

D-003028-06 


- f - » Jk — tjW AAAk Jt M jA^. A jHk — fefAjf j«jk >kk A ^"S. A W W * Jk 

TATGAAGGCTCGAGAGTTA 


576 


TSC1 


NM 000368 


24475626 


7248 


D-003028-07 


CGACACGGCTGATAACTGA 


57^ 


TO OH 


NM 000368 


24475626 


7248 


D-003028-08 


CGGCTGATGTTGTTAAATA 


•aa ytv y» 

58C 


TSC2 














I oU2 


NM 000548 


10938006 


7249 


D-003029-05 


/"** A*k A *TT' 1' A A T*/'^ - f - y™*T — I - A S*t ^* A T~ A 

GCATTAATCTCTTACCATA 


581 


I ot^i 


NM U0Uo4o 


109380UD 


7249 


D-003029-06 


CCAATGTCCTCTTGTCTTT 


58.; 


TOpO 


mm nnn^iift 

INIVI UUUOt-O 


luyocsuuo 




u-uuouzy-u / 


bbAbAUAUA 1 OAuO 1 AO 1 1 


OCX 


TSC2 


NM 000548 


10938006 


7249 


D-003029-08 


TCACCAGGCTCATCAAGAA 


58^ 


XP01 














XP01 


NM 003400 


8051634 


7514 


D-003030-05 


GAAAGTCTCTGTCAAAATA 


58i 


XP01 


NM 003400 


8051634 


7514 


D-003030-06 


GCAATAGGCTCCATTAGTG 


58C 


XP01 


NM 003400 


8051634 


7514 


D-003030-07 


GGAACATGATCAACTTATA 


58"/ 


XP01 


NM 003400 


8051634 


7514 


D-003030-08 


GGATACAGATTCCATAAAT 


586 
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Gene 
Name 


Acc# 


Gl 


L.L 


Duplex # 


Sequence 


SE 
ID 


ABL1 














ABL1 


NM 007313 


6382057 


25 


D-003 100-05 


GGAAATCAGTGACATAGTG 


5 


ABL1 


NM 007313 


6382057 


25 


D-003 100-06 


GGTCCACACTGCAATGTTT 


5 


ABL1 


NM 007313 


6382057 


25 


D-003 100-07 


GAAGGAAATCAGTGACATA 


5 


ABU 


NM 007313 


6382057 


25 


D-003 100-08 


TCACTGAGTTCATGACCTA 


5 


ABL2 














ABL2 


NM 007314 


6382061 


27 


D-0031 01-05 


GAAATGGAGCGAACAGATA 


5 


ABL2 


NM 007314 


6382061 


27 


D-003 1 01-06 


GAGCCAAATTTCCTATTAA 


5 


ABL2 


NM 007314 


6382061 


27 


D-0031 01-07 


GTAATAAGCCTACAGTCTA 


5 


ABL2 


NM 007314 


6382061 


27 


D-0031 01-08 


GGAGTGAAGTTCGCTCTAA 


5 


ACK1 














ACK1 


NM 005781 


8922074 


10188 


D-0031 02-05 


AAACGCAAGTCGTGGATGA 


5 


ACK1 


NM 005781 


8922074 


10188 


D-0031 02-06 


GCAAGTCGTGGATGAGTAA 


5 


ACK1 


NM 005781 


8922074 


10188 


D-0031 02-07 


GAGCACTACCTCAGAATGA 


5 


ACK1 • 


NM 005781 


8922074 


10188 


D-0031 02-08 


TCAGCAGCACCCACTATTA 


6 


ALK 














ALK 


NM 004304 


29029631 


238 


D-0031 03-05 


GACAAGATCCTGCAGAATA 


6 


ALK 


NM 004304 


29029631 


238 


D-0031 03-06 


GGAAGAGTCTGGCAGTTGA 


6 


ALK 


NM 004304 


29029631 


238 


D-0031 03-07 


GCACGTGGCTCGGGACATT 


6 


ALK 


NM 004304 


29029631 


238 


D-0031 03-08 


GAACTGCAGTGAAGGAACA 


6 


AXL 










* 




AXL 


NM 021913 


21536465 


558 


D-0031 04-05 


GGTCAGAGC TGGAGGA I I I 


6 


AXL 


NM 021913 


21536465 


558 


D-0031 04-06 


GAAAGAAGGAGACCCGTTA 


6 


AXL 


NM 021913 


21536465 


558 


D-0031 04-07 


CCAAGAAGATCTACAATGG 


6 


AXL 


NM 021913 


21536465 


558 


D-0031 04-08 


GGAACTGCATGCTGAATGA 


6 


BLK 














BLK 


NM 001715 


4502412 


640 


D-0031 05-05 


GAGGATGCG I GC I GGA I I I 


6 


BLK 


NM 001715 


4502412 


640 


D-0031 05-06 


ACATGAAGGTGGCCATTAA 


6 


BLK 


NM 001715 


4502412 


640 


D-0031 05-07 


GGTCAGCGCCCAAGACAAG 


6 


BLK 


NM 001715 


4502412 


640 


D-0031 05-08 


GAAACTCGGGTCTGGACAA 


6 


BMX 














BMX 


NM 001721 


21359831 


660 


D-0031 06-05 


AAACAAACCTTTCCTACTA 


6 


-BMX 


NM 001721 


21359831 


660 


D-0031 06-06 


GAAGGAGCATTTATGGTTA— 


-6 


BMX 


NM 001721 


21359831 


660 


D-0031 06-07 


GAGAAGAGATTACCTTGTT 


6 


BMX 


NM 001721 


21359831 


660 


D-0031 06-08 


GTAAGGCTGTGAATGATAA 


6 


BTK 














BTK 


NM 000061 


4557376 


695 


D-0031 07-05 


G AAC AG G AATG G AAG CTTA 


6 


BTK 


NM 000061 


4557376 


695 


D-0031 07-06 


GCTATGGGCTGCCAAATTT 


6 


BTK 


NM 000061 


4557376 


695 


D-0031 07-07 


GAAAGCAACTTACCATGGT 


6 


BTK 


NM 000061 


4557376 


695 


D-0031 07-08 


GGTAAACGATCAAGGAGTT 


6: 


C20orf64 














C20orf64 


NM 033550 


1 9923655 


11285 


D-0031 08-05 


CAACTTAGCCAAGACAATT 


6. 


C20orf64 


NM 033550 


1 9923655 


11285 


D-0031 08-06 


G AAATTG AAG GCTCAGTGA 


6. 


C20orf64 


NM 033550 


19923655 


11285 


D-0031 08-07 


TGGAACAGCTGAACATTGT 


6. 


C20orf64 


NM 033550 


1 9923655 


11285 


D-0031 08-08 


GCTTCCAACTGCTTATATA 


6 


CSF1R 














CSF1R 


NM 005211 


27262658 


1436 


D-0031 09-05 


GGAGAGCTCTGACGTTTGA 


6 


CSF1R 


NM 005211 


27262658 


1436 


D-0031 09-06 


CAACAACGCTACCTTCCAA 


6 


CSF1R 


NM 005211 


27262658 


1436 


D-0031 09-07 


CCACGCAGCTGCCTTACAA 


6 


CSF1R 


NM 005211 


27262658 


1436 


D-0031 09-08 


GGAACAACCTGCAGTTTGG 


6 


CSK 
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INIVI UU4000 


/17^P.fi77 
4 / OOU / / 


1 440 


P\ A AO 4 HA AC 

U-UUol 10-05 


OAGAATGTATTGCCAAGTA 


6 




INIVl UutOOO 


47^ft077 
4 ( OOU / / 


1 440 


Pi A AO 4 4 A AC 

U-U031 10-06 


G AAC AAAGTCG CCGTC AAG 


6 


CSK 


mm nndw* 


47^Afi77 
4 / OOU f / 


1 /MR 
1 440 


P\ A AO A A A A~7 

U-UUol 10-07 


GCGAGTGCCTTATCCAAGA 


6 




mm nn/i^ft^ 

IMIVl UU4OO0 


HfOoV f I 


•1 A A C 

144o 


r~\ AAO-4 HA AO 

U-0031 10-08 


GGAGAAGGGCTACAAGATG 


6 
















UUf \ 1 


MM ni^QQ/l 

iNivi u i oyy4 


f ooy4o4 




r\ A AO A A A AIT 

U-0031 11-05 


GGAGATGGAG 1 1 1 GAGTTT 


6 


nnRi 

UUix I 


MM fM^QQyl 

inivi u i oyy4 


/ OOy 4o4 


7on 


r~\ OAO AAA r\t~* 

D-0031 11-06 


CAGAGGCCCTGTCATCTTT 


6 




MM rH^QQ/l 

inivi un oyy4 


70CQj1 OA 

f OOy4o4 


/oU 


n\ A A O H A A A*7 

D-0031 11-07 


G CTG GT AG CTGTC AAG ATC 


6 


DDR1 


NM 013994 


7669484 


780 


D-0031 11-08 


TGAAAGAGGTGAAGATCAT 


6 


DDR2 














DDR2 


NM 006182 


545381 3 


4921 


D-0031 12-05 


GGTAAGAACTACACAATCA 


6 


DDR2 


NM 006182 


5453813 


4921 


D-0031 12-06 


GAACGAGAGTfiPrACrAAT 


U 


DDR2 


NM 006182 


545381 3 


4921 


D-0031 12-07 


ACACCAATCTGAAGTTTAT 


6 


DDR2 


NM 006182 


5453813 


4921 


D-0031 12-08 


CAACAAGAATGCCAGGAAT 


6 


DKFZp761 
P1010 














DKFZp761 
P1010 


NM_0 18423 


8922178 


55359 


D-0031 13-05 


CCTAGAAGCTGCCATTAAA 


6 


DKFZp761 
P1010 


NM_018423 

• 


8922178 


55359 


D-0031 13-06 


GATTAGGCCTGGCTTATGA 


6 


DKFZp761 
P1010 


NM 018423 


89221 78 


55359 


D-0031 13-07 




O 


P1010 


MM 01 A/19 0 * 


AOOO-I 7Q 

oy i / o 


ooooy 


T~\ AAOH HO AO 

U-OU01 1o-0o 


ppTPHT A PAT^ A A A7*AT AT" 

GGTG GTACCTG AACTGTAT 


6 


















NM 005228 


48851 98 


1956 


D-0031 14-05 


GAAGGAAACTGAATTCAAA 


6 


tbrK 


NM 005228 


A C\C\ TT A OO 

4885198 


1956 


D-0031 14-06 


GGAAATATGTACTACGAAA 


6 


rprn 

bbrK 


NM 005228 


4885198 


1956 


D-0031 14-07 


CCACAAAGCAGTGAATTTA 


6 


torK 


NM 005228 


A oo r A /\n 

48851 98 


1956 


D-0031 14-08 


GTAACAAGCTCACGCAGTT 


6 


trnAI 














trnAI 


NM 005232 


A oo cor\ o 

4885208 


2041 


D-0031 1 5-05 


GACCAGAGCTTCACCATTC 


6 


rnuA -1 

trnAI 


NM 005232 


A O O C?\f\ O 

4885208 


2041 


D-0031 15-06 


GCAAGACTGTGGCCATTAA 


6 


trnAI 


INM UU52o2 


4885208 


2041 


D-00311 5-07 


GGGCGAACCTGACCTATGA 


6 


COUA1 

trnAI 


miui nncooo 
NM U0ozo2 


4885208 


Aft >| ^4 

2041 


D-0031 1 5-08 


GATTGTAGCCGTCATC 1 1 1 


6 


trnAz 














trnA<£ 


kit a aa J yt o 4 

NM 004431 


4758277 


1969 


D-0031. 1 6-05 


GGAGGGATCTGGCAACTTG 


6 




NM 004431 


4758277 


1969 


D-0031 16-06 


GCAGCAAGGTGCACGAATT 


6 


t-rriAz 


NM 004431 


4758277 


1969 


D-0031 16-07 


GGAGAAGGATGGCGAGTTC* 


~6 


trrlAii 


NM 004431 


4758277 


1969 


D-0031 16-08 


GAAGTTCACTACCGAGATC 


6 


trnAo 














CDLIAO 

trUAO 


Kilt A nrtcooo 
NM UUo2o3 


21 361240 


2042 


D-0031 1 7-05 


GATCGGACCTCCAGAAATA 


6 


CrrlAO 


Klltfl Ancooo 
NM U0o2oo 


21361240 


2042 


D-0031 17-06 


G AACTC AG CTC AG AAG ATT 


6 


CDUAO 


InM UUOzoo 




2042 


D-0031 17-07 


GCAAGAGGCACAAATGTTA 


6 


CpUAO 
CrnnO 


MM aaaoiq 

INM UUO^OO 


obi 240 


2042 


r*v AAOH^I-7 AO 

D-0031 17-08 


A y^*t A A i'^^ A AAA A 

G AG C ATCAGTTTACAAAG A 


6 


PPHA/1 
Lrrln't 














CDU AA 

trnA4 


MM rxnAAVQ 
IMIVl UU44oo 


4/oo279 


2043 


D-0031 18-05 


GGTCTGGGATGAAGTA 1 1 1 


6 


PPWAzl 

urrlnH 


mm nfM/iQQ 

INIVI UU440O 


/17CQ07n 

4/ooz/y 


OA A O 

2043 


I~\ AAOH A O /"i O 

D-0031 18-06 


GAATGAAGTTACCTTATTG 


6 


PPHA4 


INIVI UU4400 


4f Oo2/» 


2043 


D-0031 18-07 


GAACTTGGGTGGATAGCAA 


6 




MM nfi/i/iQQ 
INIVI UU440O 


4/oo2f 9 


OA A O 

2043 


|— \ AAOd H O AO 

D-0031 18-08. 


*/^V A A | I AAA ■ r a y\ - ■ a ■ ^ a 

GAGATTAAATTCACCTTGA 


6 


EPHA7 














EPHA7 


NM 004440 


4758281 


2045 


D-0031 19-05 


GAAAAGAGATGTTGCAGTA 


6 


EPHA7 


NM 004440 


4758281 


2045 


D-0031 19-06 


CTAGATGCCTCCTGTATTA 


6 


EPHA7 


NM 004440 


4758281 


2045 


D-0031 19-07 


AGAAGAAGGTTATCGTTTA 


6 


EPHA7 


NMJD04440 


4758281 


2045 


D-0031 19-08 


TAGCAAAGCTGACCAAGAA 


6 


EPHA8 














EPHA8 


NM 020526 


18201903 


2046 


D-0031 20-05 


GAAGATGCACTATCAGAAT 


6 
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EPHA8 


NM 020526 


18201903 


2046 


D-0031 20-06 


GAGAAGATGCACTATCAGA 




EPHA8 


NM 020526 


a a -A a^ n n 

18201903 


2046 


D-0031 20-07 


AACCTGATCTCCAGTGTGA 




t~ i—\i i a n 

EPHA8 


NM 020526 


18201903 


2046 


D-0031 20-08 


BBB ^Bh-BBBBh ^Bk. M. A A ^Bi A .^B-, MV ^hBf BtBk 

TCTCAGACCTGGGCTATGT 




f~ f^i i r\ At 

EPHB1 














EPHB1 


NM 004441 


21 396502 


2047 


D-0031 21 -05 


GCGATAAGCTCCAGCATTA 




EPHB1 


NM 004441 


21396502 


2047 


D-0031 21 -06 


GAAACGGGCTTATAGCAAA 




EPHB1 


L It ■ jBt A A A A 

NM 004441 


21 396502 


2047 


D-0031 21 -07 


GGATGAAGATCTACATTGA 




EPHB1 


k, ( 1 A ^B. A A A A 

NM 004441 


21 396502 


2047 


D-0031 21-08 


G C ACGTCTCTGTC AAC ATC 




EPHB2 














EPHB2 


NM 017449 


1 7975764 


2048 


D-0031 22-05 


ACTATGAGCTGCAGTACTA 




BBBB BBW | | 

EPHB2 


NM 017449 


1 7975764 


2048 


D-0031 22-06 


GTACAACGCCACAGCCATA 




EPHB2 


NM 017449 


1 7975764 


2048 


D-0031 22-07 


GGAAAGCAATGACTGTTCT 




EPHB2 


NM 017449 


1 7975764 


2048 


D-0031 22-08 


CGGACAAGCTGCAACACTA 




EPHB3 














EPHB3 


NM 004443 


17975767 


2049 


D-0031 23-05 


GGTGTGATCTCCAATGTGA 




EPHB3 


NM 004443 


1 7975767 


2049 


D-0031 23-06 


GGGATGACCTCCTGTACAA 




EPHB3 


NM 004443 


1 7975767 


2049 


D-0031 23-07 


CAGAAGACCTGCTCCGTAT 




EPHB3 


NM 004443 


1 7975767 


2049 


D-0031 23-08 


GAGATGAAGTACTTTGAGA 




EPHB4 














■B*4 BIBBk ■ BBBk J 

EPHB4 


NM 004444 


1 7975769 


2050 


D-0031 24-05 


GGACAAACACGGACAGTAT 




EPHB4 


NM 004444 


1 7975769 


2050 


D-0031 24-06 


GTACTAAGGTCTACATCGA 




EPHB4 


NM 004444 


1 7975769 


2050 


D-0031 24-07 


GGAGAGAAGCAGAATATTC 




EPHB4 


NM 004444 


1 7975769 


2050 


D-0031 24-08 


GCCAATAGCCACTCTAACA 




EPHB6 














EPHB6 


NM 004445 


4758291 


2051 


D-0031 25-05 


GGAAGTCGATCCTGCTTAT 




EPHB6 


NM 004445 


4758291 


2051 
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U-UUO I oo-uo 


GAGA 1 1 1 CCTGGACTAGAA 


8 


NTRK1 

■ * ■ ■ xi x ■ 














NTRK1 


NM 00252Q 


458571 1 


4Q1 A 


u-uuo i oy-uo 


GGAGAACCC 1 1 1 CGAGTTC 


8 


NTRK1 

I ^ I I XI X 1 


NM 00252Q 

INIVI VVJ^vJ^CCy 


45857*1 1 


AQ1 A 


u-uuo i oy-uo 


OGAGTGACCTCAACAGGAA 


8 


NTRK1 


NM 00252Q 

INIVI \J\J t-\J C~\3 






U-UUol oy-u/ 


GGACAATACTTCAGTGATG 


8 


NTRK1 

1 ^ ■ 1 Xt X I 


NM 00252Q 

InIVI \J\J C-\J C~\J 




4Q1 A 


U-UUol oy-uo 


GAAGAGTGGTCTCCGTTTC 


8 


NTRK2 














NTRK2 


INIVI UUU | ou 


I Ou I OUO 


i4Q1 K 
4» I 0 


U-UUol oU-Uo 


GAACAGAAGTAATGAAATC 


8 


NTRK2 


MM nnRiftn 

INIVI \JKJ\J I OU 


9*1 *kC\R 
1 Ou 1 jUO 




U-UUol oU-Uo 


GTAATGCTGTTTCTGCTTA 


8 


NTRK2 


MM Dfifilftn 

IMIV1 UUO IOU 




/lQ^ c 

4yi 0 


U-UUol o0-07 


G C AAG ACACTCC AAGTTTG 


8 


NTRK2 


MM ftORIftO 


^ I OO I OUD 


4y1 o 


U-UUoloO-Oo 


GAAAGTCTATCACATTATC 


8 


NTRK3 

Mill \J \\J 














NTRK3 




*fOUO £ f / *f 


4yi d 


U-U0o1o1-05 


GAGCGAATCTGCTAGTGAA 


8 


NTRK3 


mm f.n9^^n 


4^n*\A7/l 


4yi o 


U-UUol Ol-Oo 


A A " | 1 ■ ^ /™v A y~\'T~ a a ✓"v a ^x a ^x 

GAAGTTCACTACAGAGAGT 


8 


NTRK3 


MM 009^.30. 

INIVI UU^-OOU 




4y i o 


U-UUol ol-U/ 


GGTCGACGGTCCAAA IMG 


8 


NTRK3 

1 « 1 1 \l \v 


MM 002530 

INIVI UU£JOU 




4y i o 


U-UUol Ol-Uo 


GAATATCACTTCCATACAC 


8 


PDGFRA 














PDGFRA 


MM 00R90R 

INIVI UUUiiUU 


I O^f O \ ( Of 


OTOO 


U-UUol O^i-Uo 


GAAACTTCCTGGACTA 1 1 1 


8 


PDGFRA 


NM 00R2OR 

INIVI \J \J\J/L.\J\J 


I OnO I f Of 


R1 f^ft 
0 I DO 


U-UUol OZ-Uo 


GAGA l-l^.l GG 1 CAAG IAI 1 1 


8 


PDGFRA 

1 t-^ W 1 1 X/ \ 


MM 006206 

INIVI \J\J\J£.\J\J 


1 ^4^1 7ft7 
I u*+o \ f Of 


O I OO 


U-UUol 0^!-U/ 


GOACGCCGCu CCTGATAT 


8 


PDGFRA 


NM 00R20R 


1 7P7 
I u**o \ t Of 


0 I DO 


n nnnco ad 
U-UUol O^-Uo 


CATCAGAGCTGGATCTAGA 


8 


PDGFRB 














PDGFRB 


MM 002R0Q 

INIVI UU^UUv 


1 ^4^1 78ft 
I I / OO 


t\1 

o ioy 


P» AAQHCO AC 

U-UUol oo-Uo 


AAA A A ^*^^*>T*/*N AAA *^ A *T" 

GAAAGGAGACGTCAAATAT 


8 


PDGFRB 


NM 0/12R0Q 

INIVI \JXJ £-\J\J\J 


1 ^4^1 7Pfl 


^1 

o i oy 


Pi AA'3^f;'3 nc 
U-UUol OO-UO 


G G AATGAGGTGGTCAACTT 


8 


PDGFRB 


MM '009R0Q 
inivi uu**_uuy 


I OHO I / OO 


oi oy 


U-UUol DO-U7 


CAACGAGTCTCCAGTGCTA 


8 


PDGFRB 


mm 002R0Q 

INIVI Uu&.UUv7 


I OHO I / OO 


£1 

o i oy 


p* nno'ici ao 
U-UUol Oo-Uo 


GAGAGGACCTGCCGAGCAA 


8 


PTK2 














PTK2 

■ II X 


NM 005R07 


97QOCCQO 
c-i OOOQi?zC 


^747 
OfHf 


P* ft/Ti'I C>1 AC 

U-UUO 1 04-Uo 


GAAGTTGGG 1 1 GTCTAGAA 


8 


PTK2 


NM 005R07 

INIVI UU JUUf 




Of Hi 


p* nn^dGyi ac 
U-UUo 1 04-UO 


/~\ A A A A A A A T /^^P A A 

GAAGAACAATGATGTAATC 


8 


PTK2 


NM 005607 


978AR C 1Q9 


R747 

O f H f 


n nn^iR^i r»7 
u-uuo 1 04-u ( 


f** A A AT'I /^*/^T"l" 1 A A /~* 1 ' > 

GGAAA I TGO 1 1 1 GAAGTTG 


8 


PTK2 


NM 005607 


2788R5Q9 


5747 


n_nn3iR4 oa 

u-uuo I oh- uo 


PPTTP A A r^T ^ ATT ATT — T 

bbl 1 UAAGO 1 GGA 1 TATTT 


8 


PTK2B 














PTK2B 


NM 004103 

■ "J * » ■ Xj# | 1 XJ X^ 


27886583 


2185 


u UUO I UJ"UO 


n A APATPPPTP A r^r^-rr* AT A 
bAAbA 1 bbO 1 GAuO 1 OA I A 


8 


PTK2B 


NM 004103 


9788R^83 


918*1 

Z. 1 OO 


H-nn^iR^ or 

U-UUO I OD-UO 


GGAGOAOGOTGGTCTAI 1 1 


8 


PTK2B 


NM 004103 

1 NJ 1 ■ ■ xy >v l i x/xJ 


9788RS83 


C. 1 OO 


n_nn3iR^ 07 

u-uuo I OO-U / 


f^r* A /"^O A O A /->T" A TT A AAA 

GGAGGAGGACTATTACAAA 


8 


PTK2B , 


NM 004103 


27886583 


2185 


D-0031 65-08 


TGGCAGAGGTCATCAACAA 


ft 


PTK6 














PTK6 


NM 005975 


27886594 


5753 


D-0031 66-05 


GAGAAAGTCC 1 GCCCG 1 1 1 


8 


PTK6 


NM 005975 


27886594 


5753 


D-0031 66-06 


TGAAGAAGCTGCGGCACAA 


8 


PTK6 


NM 005975 


27886594 


5753 


D-0031 66-07 


CCGCGACTCTGATGAGAAA 


8 


PTK6 


NM 005975 


27886594 


5753 


D-0031 66-08 


TGCCCGAGCTTGTGAACTA 


8 


PTK7 
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PTK7 


NM 002821 


27886610 


m~ mmm mmi A 

5754 


D-0031 67-05 


j^m* ft ft *mm± m m —mm. jm^_ jm^ ^m*. m ft M\ m 

GAGAGAAGCCCACTATTAA 


8 


PTK7 


NM 002821 


27886610 


5754 


D-0031 67-06 


CGAGAGAAGCCCACTATTA 


8 


PTK7 


NM 002821 


27886610 


5754 


D-0031 67-07 


GGAGGGAGTTGGAGATGTT 


8 


PTK7 


NM 002821 


27886610 


5754 


D-0031 67-08 


GAAGACA I GCCGCTAT I !"G 


8 


PTK9 












PTK9 


NM 002822 


4506274 


5756 


D-0031 68-05 


GAAGAACTACGACAGATTA 


8 


PTK9 


NM 002822 


4506274 


5756 


D-0031 68-09 


GAAGGAGACTATTTAGAGT 


8 


PTK9 


NM 002822 


4506274 


5756 


D-0031 68-10 


GAGCGGATGCTGTATTCTA 


8 


PTK9 


NM 002822 


4506274 


5756 


D-0031 68-11 


CTGCAGACTTCCTTTATGA 


8 


PTK9L 








- 






PTK9L 


NM 007284 


31 543446 


11344 


D-0031 69-05 


AGAGAGAGCTCCAGCAGAT 


8 


PTK9L 


NM 007284 


31 543446 


11344 


D-0031 69-06 


TTAACGAGGTGAAGACAGA 


8 


PTK9L 


ft. I ft M J A L M» J M L « 

NM 007284 


31543446 


11344 


D-0031 69-07 


ACACAGAGCCCACGGATGT 


8 


PTK9L 


NM 007284 


31 543446 


11344 


D-0031 69-08 


GCTGGGATCAGGACTATGA 


8 


RET 














RET 


NM 000323 


21536316 


5979 


D-0031 70-05 


GCAAAGACCTGGAGAAGAT 


8 


RET 


NM 000323 


21536316 


5979 


D-0031 70-06 


GCACACGGCTGCATGAGAA 


8 


RET 


NM 000323 


21536316 


5979 


D-0031 70-07 


GAACTGGCCTGGAGAGAGT 


8 


RET 


NM 000323 


21536316 


5979 


D-0031 70-08 


TTAAATGGATGGCAATTGA 


8 


ROR1 














ROR1 


NM 005012 


4826867 


4919 


D-0031 71 -05 


GCAAGCA I C I I I AC I AGGA 


8 


ROR1 


NM 005012 


4826867 


4919 


D-0031 71 -06 


GAGCAAGGCTAAAGAGCTA 


8 


ROR1 


NM 005012 


4826867 


4919 


D-0031 71 -07 


GAGAGCAACTTCATGTAAA 


8 


ROR1 


NM 005012 


4826867 


4919 


D-0031 71 -08 


GAGAATGTCCTGTGTCAAA 


8 


ROR2 






• 








ROR2 


NM 004560 


1 9743897 


4920 


D-0031 72-05 


GGAACTCGCTGCTGCCTAT 


£ 


ROR2 


NM 004560 


1 9743897 


4920 


D-0031 72-06 


G C AGGTGCCTCCTCAG ATG 


£ 


ROR2 


NM 004560 


1 9743897 


4920 


D-0031 72-07 


GCAATGTGCTAGTGTACGA 


8 


ROR2 


NM 004560 


19743897 


4920 


D-0031 72-08 


GAAGACAGAATATGGTTCA 


€ 


ROS1 














ROS1 


NM 002944 


19924164 


6098 


D-0031 73-05 


GAGGAGACCTTCTTACTTA 


8 


ROS1 


NM 002944 


19924164 


6098 


D-0031 73-06 


TTACAGAGGTTCAGGATTA 


8 


ROS1 


NM 002944 


19924164 


6098 


D-0031 73-07 


GAACAAAC0TAAGCATGAA 


8 


•■■v ^ 

ROS1 


NM 002944 


19924164 


6098 


D-0031 73-08 


GAAAGAGCACTTCAAATAA 


£ 


RYK 


• 






, — , — 






RYK 


NM 002958 


11863158 


6259 


D-0031 74-05 


GAAAGATGGTTACCGAATA 




RYK 


NM 002958 


11863158 


6259 


D-0031 74-06 


CAAAGTAGATTCTGAAGTT 




RYK 


NM . 002958L. 


11863158 


6259 


D-0031 74-07 


TCACTACG CTCTATCCTTT 




RYK 


NM 002958* 


11863158 


6259 


D-0031 74-08 


GGTGAAGGATATAGCAATA 




SRC 














SRC 


NM 005417 


21361210 


6714 


D-0031 75-05 


GAGAACCTGGTGTGCAAAG 




SRC 


NM 005417 


21361210 


6714 


D-0031 75-09 


GAGAGAACCTGGTGTGCAA 




SRC 


NM 005417 


21361210 


6714 


D-0031 75-10 


GGAGTTTGCTGGACTTTCT 




SRC 


NM 005417 


21361210 


6714 


D-0031 75-1 1 


GAAAGTGAGACCACGAAAG 




SYK 














SYK 


ft ft ft ft #^ ^ «w w 

NM 003177 


jflfe A M mam mm* m^mm. 

21361552 


6850 


D-0031 76-05 


GGAATAATCTCAAGAATCA 




SYK 


ft k ft ft ^ 

NM 003177 


21361552 


6850 


D-0031 76-06 


GAACTGGGCTCTGGTAATT 




SYK 


NM 003177 


21361552 


6850 


D-0031 76-07 


GGAAGAATCTGAGCAAATT 




SYK 


ft. 1 ft Jl ^> -| ^WWW 

NM 003177 


^ «^ J mm*, ^m* J*^. 

21361552 


6850 


D-0031 76-08 


GAACAGACATGTCAAGGAT 




TCP 














TEC 


NM 003215 


4507428 


7006 


D-0031 77-05 


GAAATTGTCTAGTAAGTGA 




TEC 


NM 003215 


4507428 


7006 


D-0031 77-06 


CACCIGAAGIGI I I AAI I A 




TEC 


NM 003215 


4507428 


7006 


D-0031 77-07 


GTACAAAGTCGCAATCAAA 




TEC 


NM 003215 


4507428 


7006 


D-0031 77-08 


TGGAGGAGATTCTTATTAA 




TEK 














TEK 


NM 000459 


4557868 


7010 


D-0031 78-05 


GAAAGAATATGCCTCCAAA 
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TEK 


NM 000459 


4557868 


7010 


D-0031 78-06 


GGAATGACATCAAATTTCA 


8 


TEK 


NM 000459 


4557868 


7010 


D-0031 78-07 


TGAAGTACCTGATATTCTA 


8 


TEK 


NM 000459 


4557868 


7010 


D-0031 78-08 


CGAAAGACCTACGTGAATA 


8 


TIE 














TIE 


NM 005424 


4885630 


7075 


D-0031 79-05 


GAGAGGAGGTTTATGTGAA 


8 


TIE 


NM 005424 


4885630 


7075 


D-0031 79-06 


GGGACAGCCTCTACCCTTA 


8 


TIE 


NM 005424 

* WWW* XX XX XX V • 


4885630 

• xx xx XX xx xx xx 


7075 


D-0031 79-07 

X^ XX XX t • XX v ' 


GAAGTTCTGTGCAAATTGG 

Xi/ / w * NX I I X^ ^ I X.X | X_X X_X § W V » ■ ■ ^""^ 


8 


TIE 


NM 005424 

■ " » • ► XX XX V* 1 1 


4885630 

• XpF xx- X* xx xx XX 


7075 

• w • xx 


D-0031 79-08 

■■P*^ X^ X» X» 1 ■ V X* XX 


CAACATGGCCTCAGAACTG 


g 


TNK1 














TNK1 


NM 003985 

I ~ ■ w I XX XX XX XX X^ X^ 


4507610 

■ XjXXX ■ xx ■ xx 


8711 


D-0031 80-05 

»^ XX XX XX l XX XX XV XX 


GTTCTGGGCCTAAGTCTAA 

X**' 1 1 X.X 1 X^ V X.X x_x | / U \ Xt* 1 X# 1 # W 1 


g 

XX 


TNK1 


NM 003985 

I 1 J ■ ■ XX X* xx xx xx xx 


4507610 

f XX X# f XX I w 


8711 

XX I | f 


D-0031 80-06 

W*^r XX V XX I XX XX XX \/ 


GAACTGGGTCTACAAGATC 

x^X^ f \# l x^^ x^^ | x^X | / \ X^r # V »x«^# 1 ■ 


g 

XX 


TNK1 


NM 003985 

■ V ■ * ■ XX »X XX Vjr X^r XX 


4507610 

* XX V 1 xx ■ xx 


8711 


D-0031 80-07 

W XX X^ XX 1 w U XX J 


CGAGAGGTATCGGTCATGA 


0 


TNK1 


NM 003985 

■ t ■ ■ f XX XX XX XX Xx XX 


4507610 

IVV f XX I xx 


8711 

xx v t * 


D-0031 80-08 

k>X XX XX XX 1 X^^X X^ XX 


GGCGCATCCTGGAGCATTA 


0 

XX 


TXK 














TXK 

■ x x ■ x 


NM 003328 

■ V I V ■ XX XX XX xx xv 


4507742 


7294 


D-0031 81 -05 

XX VX XX 1 XX 1 XX XX 


GAACATCTATTGAGACAAG 


C 

x>< 


TXK 

1 x X I X 


NM 003328 


4507742 


7294 

f XX I 


D-0031 81 -06 

Lx X> XX XX 1 \mf 1 Wx 


1 CAAGGCAC i 1 1 A 1 GA 1 1 1 

J \y#U V\x x^J x«// VVx III / \ 1 v/i III 


c 


TXK 

1 x xl A 


NM 003328 


4507742 


7294 


D-0031 81 -07 


GGAGAGGAATGGCTATATT 


c 

x> 


TXK 


NM 003328 

1 11 II XX XX XX XX &m X»* 


4507742 

■ XX XX f f Y A— 


7294 

I ^^XX t 


D-0031 81 -08 

XX XX XX J Vx I XX \J 


GGATATATGTGAAGGAATG 


c 

x^ 


TYK2 














TYK2 

III X«w 


NM 003331 


4507748 


7297 


D-0031 82-05 


GAGGAGATCCACCACTTTA 




TYK2 


NM 003331 


4507748 


7297 


D-0031 82-06 


GCATCCACATTGCACATAA 




TYK2 


NM 003331 


4507748 


7297 


D-0031 82-07 


TCAAATACCTAGCCACACT 




TYK2 

III \£- 


NM 003331 


4507748 


7297 


D-0031 82-08 


CAATCTTGCTGACGTCTTG 




TYR03 














TYR03 


NM 006293 

l^ilVI UUU£.t^v 


275Q7077 


7^01 




1 AC-5AA(-5^ 1 G l GCCA 1 1 1 

VJVJ i r\vjr\r\\jV3 i \j i i i i 




TYROV 

111 wu 


NM 0062Q3 

1NIVI \J\JKJ L-Z7\J 




7^01 


D-0031 8^-06 


ACOCTC5Af5ATTTACAACTA 




TYR03 

% 1 I X x^x XX 


NM 006293 


27597077 


7301 

r xx^x** v 


D-0031 83-07 

t-^ XX V XX 1 \xV X-r 1 


GGATGGCTCCTTTGTGAAA 

x_ X/ \ | X_^ x_x t ^»x III x^x I x_ X/ W U X 




TYR03 


NM 006293 


27597077 


7301 


D-0031 83-08 


GAGAGGAACTACGAAGATC 




YES1 














YES1 


NM 005433 


21071041 


7525 


D-0031 84-05 


GAAGGACCCTGATGAAAGA 




YES1 


NM 005433 


21071041 


7525 


D-0031 84-06 


TAAGAAGGGTGAAAGATTT 




YES1 


NM 005433 


21071041 


7525 


D-0031 84-07 


TCAAGAAGCTCAGATAATG 




YES1 


NM 005433 


21071041 


7525 


D-0031 84-08 


CAGAATCCCTGCATGAATT 


V 



Table VIII 



Gene 
Name 


Acc# 




Locus 
Link 


Duplex # 


Full Sequence 


SEQ. I[ 
NO. 


APC2 














APC2 


NM 013366 


7549800 


29882 


D-003200-05 


GCAAGGACCTCTTCATCAA 


921 


APC2 


NM 013366 


7549800 


29882 


D-003200-06 


GAGAAGAAGTCCACACTAT 


922 


APC2 


NM 013366 


7549800 


29882 


D-003200-07 


GGAATGCCATCTCCCAATG 


923 


APC2 


NM 013366 


7549800 


29882 


D-003200-09 


CAACACGTGTGACATCATC 


924 


ATM 














ATM 


NM 000051 


20336202 


472 


D-003201-05 


GCAAGCAGCTGAAACAAAT 


925 


ATM 


NM 000051 


20336202 


472 


D-003201-06 


GAATGTTGCTTTCTGAATT 


926 


ATM 


NM 000051 


20336202 


472 


D-003201-07 


GACCIGAAGICI IAI I IAA 


927 


ATM 


NM 000051 


20336202 


472 


D-003201-08 


AGACAGAATTCCCAAATAA 


928 


ATR 














ATR 


NM 001184 


20143978 


545 


D-003202-05 


GAACAACACTGCTGGTTTG 


929 


ATR 


NM 001184 


20143978 


545 


D-003202-06 


GAAGTCATCTGTTCATTAT 


930 


ATR 


NM 001184 


201 43978 


545 


D-003202-07 


GAAATAAGGTAGACTCAAT 


931 


ATR 


NM .001184 


20143978 


545 


D-003202-08 


CAACATAAATCCAAGAAGA 


932 


BTAK 
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D 1 Ar\ 


IMM UUodUU 


o21o19d 


6790 


D-003545-04 


CAAAG AATC AGCTAG CAAA 


933 


DTAI/ 

D 1 Aft, 


NM UUooUU 


3213196 


6790 


D-003203-05 


GAAGAGAGTTATTCATAGA 


934 


D 1 Ar\ 


InIVI UUodUU 


o21 oiyb 


r\ ~7C\r\ 

6790 


D-003203-07 


CAAATGCCCTGTCTTACTG 


^X >x 

935 


O 1 ISO 


NM UUodUU 


3213196 


6790 


D-003203-09 


TCTCGTGACTCAGCAAATT 


936 
















PPM a 4 


NM UU3914 


1 6306528 


8900 


D-003204-05 


GAACCTGGCTAAGTACGTA 


937 


OONA1 


NM 003914 


1 6306528 


8900 


D-003204-06 


GCAGATCCATTCTTGAAAT 


938 


PPM A -i 

OONA1 


NM 003914 


1 6306528 


A /*\ A A 

8900 


D-003204-07 


TCACAAGAATCAGGTGTTA 


939 


PPM A 4 


NM 003914 


1 6306528 


8900 


D-003204-08 


CATAAAGCGTACCTTGATA 


940 
















PPM A O 

OUNA2 


NM 001237 


1 6950653 


f\ A A 

890 


D-003205-05 


GCTGTGAACTACATTGATA 


941 


PPM A O 

OONA2 


NM 001237 


1 6950653 


A A A 

890 


D-003205-06 


GATGATACCTACACCMGA 


942 


PPM A O 

OUNA2 


NM 001237 


1 6950653 


n r\ r\ 

890 


D-003205-07 


GCTGTTAGCCTCAAAGTTT 


943 


PPM A O 

UUNA2 


NM 001237 


1 6950653 


f\ f\ A 

890 


D-003205-08 


AAGCTGGCCTGAATCATTA 


944 


PPMD ■i 














PPMD4 


KIKJI /\n/*ir*r* 

NM 031966 


14327895 


891 


D-003206-05 


CAACATTACCTGTCATATA 


945 


PPMD 4 


NM 031966 


14327895 


891 


D-003206-06 


CCAAATACCTGATGGAACT 


946 • 


PPMD A 


NM 031966 


14327895 


891 


D-003206-07 


GAAATGTACCCTCCAGAAA 


947 


PPMD A 

UUNB1 


NM 031966 


14327895 


A A ^ 

891 


D-003206-08 


G C AC CTGG CTAAG AATGT A 


948 


GCNB2 














PPMr>o 

CCNB2 


NM 004701 


10938017 


9133 


D-003207-05 


CAACAAATGTCAACAAACA 


949 


CCNB2 


ft 1 ft Jt f\. /*! jt — j A J 

NM 004701 


10938017 


9133 


D-003207-06 


GCAGCAAACTCCTGAAGAT 


950 


CCNB2 


NM 004701 


10938017 


9133 


D-003207-07 


CCAGTGATTTGGAGAATAT 


951 


CCNB2 


NM 004701 


10938017 


9133 


D-003207-08 


GTGACTACGTTAAGGATAT 


952 


CCNB3 














CCNB3 


k I Wk M ^X ^x M*\. -A 

NM 033031 


1471 9419 


85417 


D-003208-05 


TGAACAMCTGCTGACTTT 


953 


CCNB3 


A. 1 ft. A ^X ^X ^X *^k, a 

NM 033031 


14719419 


85417 


D-003208-06 


GCTAGCTGCTGCCTCCTTA 


954 
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1509 
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PPARD 



PPARD 



NM 006238 



NM 006238 



5453939 



5453939 



D-003435-02 



D-003435-03 



GCATGAAGCTGGAGTACGA 



GGAAGCAGTTGGTGAATGG 



1510 



1511 



PPARD 



NM 006238 



5453939 



D-003435-04 



G CTGCAAG ATTC AG AAG AA 



1512 



PPARG 



PPARG 



PPARG 



PPARG 



RARA 



RARA 



RARA 



RARA 



NM 138712 



20336234 



D-003436-01 



AGACTCAGCTCTACAATAA 



NM 138712 



20336234 



D-003436-02 



GATTGAAGCTTATCTATGA 



NM 138712 



20336234 



D-003436-03 



AAGTAACTCTCCTCAAATA 



NM 138712 



20336234 



D-003436-04 



GCATTTCTACTCCACATTA 



NM 000964 



450641 8 



D-003437-01 



GACAAGAACTGCATCATCA 



NM 000964 



4506418 



D-003437-02 



GCAAATACACTACGAACAA 



NM 000964 



450641 8 



D-003437-03 



GAACAACAGCTCAGAACAA 



NM 000964 



450641 8 



D-003437-04 



GAGCAGCAGTTCTGAAGAG 



1513 



1514 



1515 



1516 



1517 



1518 



1519 



1520 



RARB 



RARB 



RARB 



RARB 



NM 000965 



14916493 



D-0034 38-01 



GCACACTGCTCAATCAATT 



NM 000965 



14916493 



D-003438-02 



NM 000965 



GCAGAAGTATTCAGAAGAA 



14916493 



D-003438-03 



GGAATGACAGGAACAAGAA 



NM 000965 



14916493 



D-003438-04 



GCACAGTCCTAGCATCTCA 



1521 



1522 



1523 



1524 



RARG 



NM 000966 



21359851 



D-003439-01 



GAAATGACCGGAACAAGAA 



1525 



RARG 



RARG 



NM 000966 



21359851 



D-003439-02 



TAGAAGAGCTCATCACCAA 



NM 000966 



21359851 



D-003439-03 



CAAGGAAGCTGTGCGAAAT 



1526 



1527 



RARG 



NM 000966 



21359851 



D-003439-04 



TCAGTGAGCTGGCTACCAA 



1528 



RORA 



NM 134261 



19743902 



D-003440-01 



GGAAAGAGTTTATGTTCTA 



1529 



RORA 



RORA 



RORA 



NM 134261 



19743902 



D-003440-02 



CAAGATCTGTGGAGACAAA 



NM 134261 



19743902 



D-003440-03 



NM 134261 



19743902 



D-003440-04 



GCACCTGACTGAAGATGAA 
CCGAGAAGATGGAATACTA 



1530 



1531 



1532 



RORB 



RORB 



RORB 



RORB 



RORC 



RORC 



RORC 



RORC 



RXRA 



NM 006914 



1 9743906 



D-003441-01 



NM 006914 



GCACAGAACATCATTAAGT 



19743906 



D-003441-02 



NM 006914 



19743906 



NM 006914 



1 9743906 



NM 005060 



1 9743908 



NM 005060 



1 9743908 



NM 005060 



1 9743908 



NM 005060 



1 9743908 



NM 002957 



21536318 



CCACACCTATGAAGAAATT 



D-003441-03 



GATCAAATTCTACTTCTGA 



0^.003441-04 



TCAAAC AG ATAAAG CAAGA 



D-003442-01 



TAGAACAGCTGCAGTACAA 



D-003442-02 



TCACCGAGGCCATTCAGTA 



D-003442-03 



GAACAGCTGCAGTACAATC 



D-003442-04 



CCTCATGCCACCTTGAATA 



D-003443-01 



TGACGGAGCTTGTGTCCAA 



1533 



1534 



1535 



1536 



1537 



1538 



1539 



.1540 



1541 



RXRA 



RXRA 



RXRA 



RXRB 



RXRB 



RXRB 



NM 002957 



21536318 



D-003443-02 



NM 002957 



CAACAAGGACTGCCTGATT 



21536318 



D-003443-03 



GCAAGGACCTGACCTACAC 



NM 002957 



21536318 



D-003443-04 



GCAAGGACCGGAACGAGAA 



NM 021976 



21687229 



D-003444-01 



GCAAAGACCTTACATACTC 



NM 021976 



21687229 



D-003444-02 



NM 021976 



GCAATCATTCTGTTTAATC 



21687229 



D-003444-03 



TCACACCGATCCATTGATG 



1542 



1543 



1544 



1545 



1546 



1547 



RXRB 



NM 021976 



21687229 



D-003444-04 



GCAAACGGCTATGTGCAAT 



1548 



RXRG 



RXRG 



RXRG 



RXRG 



THRA 



THRA 



NM 006917 



21361386 



D-003445-01 



NM 006917 



GGAAGGACCTCATCTACAC 



21361386 



D-003445-02 



NM 006917 



CCGGATCTCTGGTTAAACA 



21361386 



D-003445-03 



NM 006917 



GCGAGCCATTGTACTCTTT 



21361386 



D-003445-04 



GAGCCATTGTACTCTTTAA 



NM 003250 



20127451 



D-003446-01 



NM 003250 



GGACAAAGACGAG CAGTGT 



20127451 



D-003446-02 



GGAAACAGAGGCGGAAATT 



1549 



1550 



1551 




1553 



1554 



WO 2004/045543 



PCT/US2003/036787 



129 



Tun a 

1 HKA 


InM UUoZoO 


Ort A 17 AC A 


D-003446-03 


GTAAGCTGATTGAGCAGAA 


1555 


TUDA 


ma/i nniocn 
NM UUo^oU 


OZ"M 1~7v1 CA 


U-00o44o-04 


GAACCTCCATCCCACCTAT 


1556 














TLIDD 


NM U004o1 


108351 22 


D-003447-01 


GAATGTCGtJl I lAAGAAAT 


1557 


Tunn 

1 HKd 


NM 000461 


1 08351 22 


D-003447-02 


GAACAGTCGTCGCCACATC 


1558 


THRR 

i M IAD 


mm nnn^ifii 


I UOOO I /lZ 






•i ceo 


THRB 


NM 000461 


10835122 


D-003447-04 


GTGGAAAGGTTGACTTGGA 


1560 














VDR 


NM 000376 


4507882 


D-003448-01 


TGAAGAAGCTGAACTTGCA 


1561 


VDR 


NM 000376 


4507882 


D-003448-02 


G C AACCAAG ACTAC AAGTA 


1562 


VDR 


NM 000376 


4507882 


D-003448-03 


TCAATGCTATGACCTGTGA 


1563 


VDR 


NM 000376 


4507882 


D-003448-04 


CCATTGAGGTCATCATGTT 


1564 



Table X 



Gene 

Symbol 

ABCB1 



ABCC1 



ABCG2 



KCNH2 



KCNH1 



CLCA1 



<iffe Sense* ; m «jp 


:.• - SEQ¥;ID NO ^ 

- m ' 


GACCAUAAAUGUAAGGUUU 


1565 


UAGAAGAUCUGAUGUCAAA 


1566 


GAAAUGUUCACUUCAGUUA 


1567 


GAAGAUCGCUACUGAAGCA 


1568 






- ,. : . . I'll 


GGAAGCAACUGCAGAGACA 


1569 


GAUGACACCUCUCAACAAA 


1570 


UAAAGUUGCUCAUCAAGUU 


1571 


CAACGAGUCUGCCGAAGGA 


1572 




- Sense £v ' 




GCAGAUGCCUUCUUCGUUA 


1573 


AGGCAAAUCUUCGUUAUUA 


1574 


GGGAAGAAAUCUGGUCUAA 


1575 


UG ACUC AUCCC AACAU U UA 


1576 








CCGACGUGCUGCCUGAGUA 


1577 


GAGAAGAGCAGCGACACUU 


1578 


GAUCAUAGCACCUAAGAUA 


I 1579 


GCUAUUUACUGCUCUUAUU 


| 1580 


UCACUGGGCUCCUUUAAUU 


1581 


GUGCGAGCCUUCUGAAUAU 


1582 


GCUAAGCUAUACUACUGUA 


I 1583 


UGACGGCGCUCUACUUCAC 


I 1584 




Sense -J - : 




GAGAUGAAUUCCUUUGAAA 


\ 1585 


GAAGAACGCAUGAAACGAA 


! 1586 


GAUAAAGACACGAUUGAAA 


i 1 587 ~i 


GCUGAGAGGUCUAUUUAAA 


| 1 588 | 




Sense 




GAACAACAAUGGCUAUGAA 


1589 


GUACAUACCUGGCUGGAUU 


1590 
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SLC6A1 



SLC6A2 



GAACAGCUCACAAGUAUAU 


1591 


GGAAACGUGUGUCUAUAUU 


1592 I 




Sense 4- .'• 




G GAG G UGGG AG GACAG U U A 


1593 


| UCACAGCCCUGGUGGAUGA 


1594 


GAAGCUGGCUCCUAUGUUC 


1595 


GGUCAACACUACCAACAUG 


1596 




■ v-:; Sense y< :.. 




GAACACAAGGUCAACAUUG 


1597 


AGAAGGAGCUGGCCUAGUG 


1598 


CGGAAACUCUUCACAUUUG 


1599 


CAAC AAAU U UG AC AAC AAC 


1600 



SLC21A2 



. :.;m i < , Sense:-- 1 . ' -.^^ 


$ ><\ ■ 


GUACAUCUCCAUCUUAUUU 


1601 


GGAAGUGGCUGAGUUAUUA 


1602 


! GAAGGGAGGCUCAAUGUAA 


1603 


i GAAGGAAGUGGCUGAGUUA 


1604 I 



SLC21A3 



-~ " ; /; Sense IfWV''. 




* '..Al^. . * * * 
' \ (r***' * + 

" *► t . 1 - \ 


G U AG AAAC AGG AGCU AU U A 




1605 


CAAGAUUACUGUCAAACAA 




1606 


GCACAAGAGUAUUUGGUAA 


i 


1607 


GCAAAUGUCCCUUCUGUAU 


! 

1 


1608 


GCAUGACUCCUAUAUAAUA 


1 


1609 


AAACAGCAAUUUCCCUUAA 




1610 


GAAAAUGCCUCUUCAGGAA 


— . ~ t - 

i 

1 


1611 



SLC28A1 



•.<; • ■ . Sense - :-'^Ktr 1 


■ '.o? :. 


GUUCAUCGCUCUCCUCUUU 


" 1612 


GGAUCAAGCUGUUUCUGAA - ~r 


1613 


GGACUGCAGUUUGUACUUG 


1614 


GAGUGAAACUGACCUAUGG 


1615 



SLC29A1 



if''' - : Sensed ;r-^-; 


; . ■ v 

>-■■.•.:<- * ■ : ■ 


GAACGCUGCUCCCGUGGAA 


161 


6 


GAAAGCCACUCUAUCAAAG 


161 


7 


GAAACCAGGUGCCUUCAGA 


161 


8 


CCUCACAGCUGUAUUCAUG 


161 


9 



SLC26A1 



SLC26A2 



Sense- . ;r • \) 




j CCACGGAGCUGCUGGUCAU 


1620 


GGGUUGACAUCUUAUUUGA 


1621 


GCACGAGGGUCUCUGUGUU 


1622 


GGCCAUCGCCUACUCAU UG 


1623 


CAACACCCAUGGCAAUUAA 


1624 


GAGGAAAGAUCUUGCUGAU 


1625 


GAGCAAGCGUCCUCCAAAU 


1626 


GCAACACCCAUGGCAAUUA 


1627 




/- Sense * - 




CCAAAG AAC U CAAU G AAC A 


1628 


ACAAGAACCUUCAGACUAA 


1629 


GAAGGUAGAUAGAAGAAUG 


1630 
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SLC4A4 



GLRA1 



j GUAUUGAACUGUACUGUAA 


1631 




■ " *' • Jfi\ Sense " v \, . 


i . f * . , ' 


GCAAUUCUCUUCAUUUAUC 


1632 


GGAAAGAUGUCCACUGAAA 


1633 


GGACAAAGCCUUCUUCAAU 


1634 


GGAAUGGGAUCCAGCAAUU 


1635 




Sense 


' ; . 

-I 


UGAAAGCCAUUGACAUUUG 


1636 


CAGACACGCUGGAGUUUAA 


1637 


CAAUAGCGCUUUCUGGUUU 


1638 


GCAGGUAGCAGAUGGACUA 


1639 



KLK1 



. l . * ; Sesnse ' V - 






UCAGAGUGCUGUCUUAUGU 




1640 


CAACUUGUUUGACGACGAA 


! 


1641 I 


UGACAGAGCCUGCUGAUAC 




1642 


AGGCGGCUCUGUACCAUUU 




1643 



ADAM2 



XPNPE 
PI 



GZMA 



CMKLR1 



CLN3 



CALCR 



. • Sense . . 




GAAACAUGCUGUGAUAUUG 


1644 


GCAGAUGUUUCCUUAUAUA 


1645 


CAACAGAGAUGCCAUGAUA 


1646 


GAAAGGCGCUACAUUGAGA 


1647 




• -'V-a ^Sense ' J% 

. £ .'.->: ' ' ■ i 4 ; - * i' 

>..;'/ V ; .JL . • - 


- ■ V^ 1 - "J - v 
^Siivw^ . . *■.■*&:-'* 


GACCUGAGCUUCCCAACAA 


1648 


GCGACUGGCUCAACAAUUA 1 1649 


GAGAUUGCGUGGCUAUUUA | 1650 


GACAGCAACUGGACACUUA | 1651 


■ 

- t ' 


h ■ "^iSense : • ■-" ; . . 




GGAAGAGACUCGUGCAAUG 


1652 


GGAACCAUGUGCCAAGUUG 


1653 


GAAGUAACUCCUCAUUCAA 


1654 


GAACUCCUAUAGAUUUCUG "° 


1655 




-vkT'- -i&Sense' ' *■ ■ 




CAUAGAAGCUUUACCAAGA 


1656 


IGAAUGGAGGAUGAAGAUUA 


1657 


GGUCAAUGCUCUAAGUGAA 


1658 


GAGAGGACUUCUAUGAAUG 


1659 




' ; 1 Sense './ 


it * ' - 


CAUCAUGCCUUCUGAAUAA 


1660 


CAACAGCUCAUCACGAUUU 


1661 


GCAACAACUUCUCUUAUGU 


1662 


GGUCUUCGCUAGCAUCUCA 


1663 




. Sense 




GGACCUAGCUGUUGUAAAG 


1664 


G AAAGACC AU GC AU U U AAA 


1665 


GCAGGAAGAUGUAUGCUUU 


1666 


G AAU AAACC AG U AU CG U UA 


1667 
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OXTR 



EDG4 



EDG5 



EDG7 



PTCH 



SMO 



CASP3 



CASP6 



CASP7 



* Sense^ 




GGACCCAGAUAUCCAAAUA 


1668 


GCAAUACUAUCCUAACUGA 


1669 


G AAU AU AG AU UAGCGUUUG 


1670 


GAUGAGGCAUGACUACUAA I 


1671 




; ; Sense .' . / 




GCGAGUCUGUCCACUAUAC 


1672 


GAGAACGGCCACCCACUGA 


1673 


| GAACGGCCACCCACUGAUG 


1674 


GGUCAAUGCUGCUGUGUAC 


1675 




€ ' ' 'Mi :;Sensei| . 




UCCAGGAACACUAUAAUUA 


1676 


GUGACCAUCUUCUCCAUCA 


1677 


CAUCCUCUGUUGCGCCAUU 


1678 


CCAACAAGGUCCAGGAACA 


1679 




: ' . -Sense i, ■ '--\.' 1 


~ : ~ ' ' ^ y : *V ■ 

' "> ■ - ^ * ' i 


AC ACUG AU AC UG UCG AU G A 


1680 


AAUAGGAGCAACACUGAUA 


1681 | 


, CAGCAGGAGUUACCUUGUU 


1682 ! 


t GG ACACCCAU GAAG C U AAU 


1683 i 




. : ih . : *&Seri$g&: . .3H\ . • 


, : ■ n 


GCACAGAACUCCACUCAAA 


1684 


GGACAGCAGUUCAUUGUUA 


1685 


GAGAAGAGGCUAUGUUUAA 


1686 | 


GGACAAACUUCGACCCUUU 


1687 | 




: S ' - % i: . *$ense r $ 




UCGCUACCCUGCUGUUAUU I : ? 1688 I 


GCUACAAGAACUACCGAUA 


i ™" 1 689 | 


CAAGAAAGCUUCCUUCAAC 


I 1 690 


GAGAAGAAAUACAGUCAAU 


1691 




: : - I'l'l / "il S'ense .. u- : 


■ ■ * v * . r^i " * 


CAAUAUAUCUGAAGAGCUA 


1692 


GAACUGGACUGUGGCAUUG 


1693 I 


GUGAGAAGAUGGUAUAUUU ! 1694 


GAGGGUACUUUAAGACAUA 


1 1695 




'"- . t Sense ■ \; ' * ". -i:. 


CAUGAGGUGUCAACUGUUA 


j 1696 


GAAGUGAAAUGCUUUAAUG I 1697 


AAAUAUGGCUCCUCCUUAG 


1698 


GCAAUCACAUUUAUGCAUA 


1699 | 


CAACAUAACUGAGGUGGAU 


I 1700 


C AU GG U AC AU U C AAG AU U U 


I 1701 


* 


:■ Sense: * . : . 




GAACUCUACUUCAGUCAAU 


1704 


GGGCAAAUGCAUCAUAAUA 


1703 


CAACAGAGGGAGUUUAAUA 


1704 


GAACAAAGCCACUGACUGA 


1705 



WO 2004/045543 
CASP8 
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CASP9 



DVL1 



DVL2 



PTEN 



PDK1 



PDK2 



PPP2CA 



CTNNA1 



133 


y^ w- jr ?_ fi ;v Sense or , \ . * v 


i * ... ■ -.^*** 


GAAG U G AAC U AUG MG U AA 


i 1706 


CAACAAGGAUGACAAGAAA 


1 1707 


GGACAAAGUUUACCAAAUG 


! 1708 


GAGGGUCGAUCAUCUAUUA 


■ 1709 


GAAUAUAGAGGGCUUAUGA 


' 1710 


CAACGACUAUGAAGAAUUC 


■ 1711 i 


GAAGUGAGCAGAUCAGAAU 


1712 


GAGGAAAUCUCCAAAUGCA 


1713 




. •■V : ■ t Sense .y '. \ "■ 




CCAGGCAGCUGAUCAUAGA 


1714 * J 


UCUCAGGUGUUGCCAAAUA 


1715 


GAACAGCUGUAAUCUAUGA 


1716 


CCACUGGUCUGUAGGGAUU 


\ 1717 




i nS' • Bense ■ : ' # \ ' i 


lA-; - ill.* . 
~~ ' : -V 'fU 


UCGUAAAGCUGUUGAUAUC 


1718 


GAGGAGAUCUUUGAUGACA 


1719 


GUAAAGCUGUUGAUAUCGA 1 


1720 


GAUCGUAAAGCUGUUGAUA 


1721 




v ,: * .•■* Sense' 'Si- *%.\ 


^ - -k- v. 
:. ? 


AG ACG AAG G U G A U U U ACC A j 1799 


UGUGAGAGCUACCUAGUCA 


1723 


GAAGAAAUUUCAGAUGACA 


1724 


UAAUAGGCAUUUCCUCUUU 


1725 




' ft % t Sense, % '^^VK 


' ^ : £\ - '> : *> ' 


GUGAAGAUCUUGACCAAUG 


1726 


GAUCAGCAUACACAAAUUA 


1727 


GAAUGAACCUUCUGCAACA | 


1728 


GGCGCUAUGUGUAUUAUUA I 


1729 



^ : %• ,'^Sense^. 


*' 

*V * : ■ " 

■ . . x- . . •-. 


GUACAAAGCUGGUAUAUCC 


1730 


GAAAGACUCCCAGUGUAUA 


1731 


GGAAGUCCAUCUCAUCGAA 


1 732 


CCAAAGACAUGACGACG UU 


1733 








GUAAAGAGGAGACUGAAUG 


1734 


J^GUCUGUGAUGGUCCCUAA 


1735 


I CAAAGAUGCCUACGACAUG 


1736 


| GGGCGAUGCCUGAGGGUUA j 1737 




'. Sense •'■ p ■ !.' 




UCACACAAGUUUAUGGUUU 


1738 


CAAC AGCCG UG ACC ACU 0 U 


1739 


UAACCAAGCUGCAAUCAUG 


1740 


GAACUUGACGAUACUCUAA 


1741 




Sense 




| GAAGAGAGGUCGUUCUAAG 


1742 


i AAGCAGAUGUGCAUGAUUA 


1743 


UCUAAUAACUGCAGUGUUU 


1744 ~| 
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GUAAAGGGCCCUCUAAUAA 



JZ45 | 



CTNNA2 



HSPCA 



DCTN2 



CD2 



BAD 



SMAC 



' .' ^.'^ Sense ? * 




GAAAGAAUAUGCCCAAGUU 


1746 


GAAGAAGAAUGCCACAAUG 


1747 


GCAGGAAGAUUAUGAUGUG 


1748 


AAAGAAAGCCCAUGUACUA 


1749 




1 Sense i ^ : • ■ 




GGGAAAGAGCUGCAUAUUA 


1750 


GCUUAGAACUCUUUACUGA 


1751 


UAUAAGAGCUUGACCAAUG 


1752 


GCAGAUAUCUCUAUGAUUG 


1753 




. 5\i Sense C. ; . 


* * ■ r 


CAACUCAUGUCCAAUACUG 


1754 


GGAAUGAGCCAGAUGUUUA 


1755 


GGAGACAGCUGUACGUUGU 


1756-' 


UCCAAGAGCUGACAACUGA 


1757 








GUAAGGAGAAGCAAUAUAA 


1758 


AAG AU G AG C U U U CC AUG U A 


• 1759 


GG AC AUG U AU CU CAUCAU U 


1760 


GACAAGAGCCCACAGAGUA - 


1761 




- ' -W^ Sense ; ■ 5 




GUACUUCCCUCAGGCCUAU 


1762 


GCUGUGCCUUGACUACGUA 


1763 


GUACUUCCCUCAGGCCUAU 


1764 


GGUCAGGUGCCUCGAGAUC | 1765 




; - ;r tSense ' \ '^-y/' ' ' : ' 


: -y^;hA- • * V.--&V: ■ 


CAGCG U AAC U U CAU UCU UC 


1766 


UAACUUCAUUCUUCAGGUA 


1767 


CAGCUGCUCUUACCCAUUU 


1768 


GAUUGAAGCUAUUACUGAA 


1769 


UAGAAGAGCUCCGUCAGAA 


* 1770 


CCACAUAUGCGUUGAUUGA 


1771 


GCGCAGGGCUCUCUACCUA 


1772 



MAP3K5 



Sense' ; - -?\ 




GAACAGCCUUCAAAUCAAA 


1773 


GAUGUUCUCUACUAUGUUA 


1774 


GCAAAUACUGGAAGGAUUA 


1775 


CAGGAAAGCUCGUAAUUUA 


1776 



PVR 



ERBB2 



Sense . 




CCACACGGCUGACCUCAUA 


1777 


CAGCAGAAUUCCUCUUAUA 


1778 


G C AG A AU U CC U C U U AU AAA 


1779 


GAUCGGGAUUUAUUUCUAU 


1780 




* Sense .:<•: 




UGUGGGAGCUGAUGACUUU 


1781 


UCACAGAGAUCUUGAAAGG 


I 1782 
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SOS1 



BRCA1 



CDKN1A 



UGGAAGAGAUCACAGGUUA ! 1783 


GCUCAUCGCUCACAACCAA | 1784 




. :Sense ' <y : i 


> 


GAGCACCACUUCUAUGAUU 


1785 


CAAAGAAGCUGUUCAAUAU 


1786 


UGAAAGCCCUCCCUUAUUA 


1787 


GAAAUAGCAUGGAGAAGGA 


1788 




• - ■ ^KSfef 1 Sense :'. ■ #fe. 




CCAUACAGCUUCAUAAAUA 


1789 • 


GAAGAGAACUUAUCUAGUG 


1790 


GAAGUGGGCUCCAGUAUUA 


1791 


GCAAGAUGCUGAUUCAUUA 


1792 


GAAGUGGGCUCCAGUAUUA 


1793 


GAACGGACACUGAAAUAUU 


1794 


GCAGAUAGUUCUACCAGUA 


1795 






h^mu . 


GAACAAGGAG.UCAGACAUU 


1796 


AAACUAGGCGGUUGAAUGA 


1797 


GAUGGAACUUCGACUUUGU 


1798 


GUAAACAGAUGGCACUUUG 


1799 



CDKN1B 





&s>;> ■ ■ «f?te 


GGAAUGGACAUCCUGUAUA 


1800 


GGAGAAAGAUGUCAAACGU 


1801 


GAAUGGACAUCCUGUAUAA 


1802 


GUAAACAGCUCGAAUUAAG 


1803 



SLC2A4 



N0S2A 



FRAP1 



FKBP1A 



Sense ^ -. \ 


i ■ . i i' .1?.'. 


CAGAUAGGCUCCGAAGAUG 


1804 


AGACUCAGCUCCAGAAUAC 


1805 ? 


GAUCGGUUCUUUCAUCUUC 


1806 


CAGGAUCGGUUCUUUCAUC 


1807 




' r ' Sense ; .B 7 * tl, ,:. 


: . Vv • ,-. . 


CCAGAUAAGUGACAUAAGU 


1808 


UAAGUGACCUGCUUUGUAA 


1809 


GAAGAGAGAUUCCAUUGAA 


1810 


UGAAAGAGCUCAACAACAA 


1811 




.,XS: ' Sense : ■ 


' '••V'..': 


GAGCAUGCCGUCAAUAAUA 


1812 


CAAGAGAACUCAUCAUAAG 


1813 


CCAAAGUGCUGCAGUACUA 


1814 


UAAGAAAGCUAUCCAGAUU 


1815 




Sense 


f * \ ■ - r 


G AAAC AAGCCC U U UAAG U U 


1816 


GAAUUACUCUCCAAGUUGA 


1817 


CAGCACAAGUGGUAGGUUA 


1818 


GUUGAGGACUGAAUUACUC 


1819 


GAUGGCAGCUGUUUAAAUG 


1820 


GAGUAUCCUUUCAGUGUUA 


1821 
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TNFRSF 
1A 



IL1R1 



IRAKI 



TRAF2 



TRAF6 



TRADD 



FADD 







CAAAGGAACCUACUUGUAC 


1822 


j GGAACCUACUUGUACAAUG 


1823 


! GAACCUACUUGUACAAUGA 


1824 


1 GAGUGUGUCUCCUGUAGUA 


1825 




- ' ■ ■. Sense • 




GGACAAGAAUCAAUGGAUA 


1826 


GAACAAGCCUCCAGGAUUC 


1827 


GGACUUGUGUGCCCUUAUA 


1828 


GAACACAAAGGCACUAUAA 


1829 




? if:. ?Sense \f '<M 




CGAAGAAAGUGAUGAAUUU 


1830 


GCUCUUUGCCCAUCUCUUU 


1831 


UGAAAGACCUGGUGGAAGA 


1832 


GCAAUUCAGUUUCUACAUC 


1833 




■£''"■4 ■ : W^: " S]bnse r -j ; ' 


. *■ las . 


G AAG ACAG AG U U AU UAAAC 


1834 


UCACGAAGACAGAGUUAUU 


1835 


AG AC AG AG U U AU UAAAC CA 


1836 


CACGAAGACAGAGUUAUUA 


1837 


GCUGAAGCCUGUCUGAUGU 


1838 




- I-**-: " ; l$ense. xk. -il&f .ill'-: 


■ T." »- * 

- . ; • ' .if* 

. ■ - - . /".vj 


CAAAUGAUCUGAGGCAGUU 


1839 


GUUCAUAGUUUGAGCGUUA 


1840 


GGAGAAACCUGUUGUGAUU 


1841 


GGACAAAG U UGCUG AAAUC 


1842 


1 CAAAUGAUCUGAGGCAGUU 


1843 


GGAGAAACCUGUUGUGAUU | 1844 


GGACAAAGUUGCUGAAAUC j 1845 


GUUCAUAGUUUGAGCGUUA j 1846 




' ~ : 1 z - : -;f ; Sense |i ; ■ - %■> 




UGAAGCACCUUGAUCUUUG 


1847 - 


GGGCAGCGCAUACCUGUUU 


1848 


GAGGAGCGCUGUUUGAGUU 


1849 


GG ACGAGGAGCGCUG U U UG 


1850 


GAGGAGCGCUGUUUGAGUU 


1851 


GGAUGUCUCUCUCCUCUUU 


1852 


GCUCACUCCUUUCUACUAA 


1853 


] UGAAGCACCUUGAUCUUUG 


1854 




Sensei: -vi* 




GCACAGAUAUUUCCAUUUC | 1855 


GCAGUCCUCUUAUUCCUAA 


1856 


GAACUCAAGCUGCGUUUAU 


1857 


GGACGAAUUGAGAUAAUAU 


1858 



IKBKE 



Sense 



UAAGAACACUGCUCAUGAA 



GAGGCAUCCUGAAGCAUUA 



GAAGGCGGCUGCAGAACUG 



1 859 
1 860 



1861 
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GGAACAAGGAGAUCAUGUA 



1862 ~] 



IKBKG 



TNFRSF 
5 



RELA 



ARHA 



CDC42 



ROCK1 



PAK1 



PAK2 



,.->'•- Sense 




CUAUCGAGGUCGUUAAAUU 


1863 




GAAUGCAGCUGGAAGAUCU 


1864 




GCGGCGAGCUGGACUGUUU 


1865 




CCAGACCGAUGUGUAUUUA 


1866 


. .. . . ~i 





— „ ., . . -i 


"Sense ^ ; J 




GGUCUCACCUCGCUAUGGU 


1867 


GAAAGCGAAUUCCUAGACA 


1868 


. G C ACAAAC A AG AC U GAL) G U 


1869 


G AAGG GCACC UC AG AAACA 


1870 


UCUCCCAACUUGUAUUAAA 


1871 




It Sense , . 


£ V 


UCAAGUGUCUUCCAUCAUG 


1872 


UCAAGUGCCUUAAUAGUAG 


1873 


GGAGUACCCUGAGGCUAUA 


1874 


I GAUGAGAUCUUCCUACUGU 


1875 




■ .. ^i: . ?. ■ Cerise * ■ \ 'gf' - 


. - . *i< 


GAGCUGGGCUAAGUAAAUA 


1876 


GACCAAAGAUGGAGUGAGA 


1877 


GGAAGAAACUGGUGAUUGU 


1878 


GGCUGUAACUACUUUAUAA 


1879 




"-to Sense ./% 'M- 


. & "■' -'.-x , 


GGACAUUUGUUUGCCAUUU 


1880 


GG AGAACCAUAUACUCU UG 


1881 


GAACCAAUGCUUUCUCAUG 


1882 


GAAGACCUG U U AUG U AG AG 


1883 


GAUCAAGAAUUGCAAUAUC 


1884 


GAAAAGGGGUGACCUAGUA 


1885 


UGACAAACCUUAUGGAAAA 


1886 




1 : M- ' - Sense - 'ft ?' 


■ ■ \ ' <; 

V ; s*. ■ it 


1 GGAAUGAGCUUCAGAUGCA 


1887 


GGACACAGCUGUAAGAUUG 


1888 


G ACAAG AG AU U ACAG AU AA 


1889 


GAAGAAACAUUCCCUAUUC 


1890 




' "H- r; z. Sense 




GAGGG U G G U U U AU G AU U AA 


1891 


CAACAAAG AAC AAU CAC UA 


1892 


GAAG AAAU AU AC ACGG U U U 


I 1893 


UACAUGAGCUUUACAGAUA 


I 1894 




Sense 




GGUAGGAGAUGAAUUGUUU 


1895 


AGAAGGAACUGAUCAUUAA 


1896 | 


CUACAGACCUCCAAUAUCA 


1897 ! 


GAAAC U G G CCAAACCG U U A 


1898 I 



PAK3 



Sense 
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1 GAUUAUCGCUGCAAAGGAA 
GAGAGUGCCUGCAAGCUUU 


.. _. 1 




1899 
1900 


GACAAGAGGUGGCCAUAAA 






1901 


UUAAAUCGCUGUCUUGAGA 






1902 




: \ ? • ' Sense 






ACUAAGAGGUGAACAUGUA 






1903 


GAUCAUGAAUGUCCGAAGA 




1904 


GAUGAGACCCUACUACUGA 






1905 


CAGCAAAGGUGCCAAAGAU 






1906 



PAK6 



. ' Sense;- 


ey*: ' 

■i-<\ . 




UAAAGGCAGUUGUCCACUA 


1907 


GAAGGGACCUGCUUUCUUG 


1908 


GCAAAGACGUCCCUAAGAG 


1909 


CCAAUGGGCUGGCUGCAAA 




1910 



PAK7 





¥ t ' . : «■■ <■■£ 

,« ' ■ <p . * * * 


GAGCACGGCUUUAAUAAGU 


1911 


CAAACUCCGUUAUGAUAUA 


1912 


GGAUAAAGUUGUCUGAUUU 


1913 


GGAAAUGCCUCCAUAAAUA 


1914 



HDAC1 



:, ^■wm.v. Sense - - : 


- -'. : - 


GGACAUCGCUGUGAAUUGG 


1915 


AGAAAGAAGUCACCGAAGA 


1916 


GGACAAGGCCACCCAAUGA 


1917 


CCACAGCGAUGACUACAUU 


1918 



HDAC2 



" ('Mil Sense \. ' "r 


\t ' - t 1 "~'r. "%2- — V IJ '.' 

V'\ • . 


GCUGUUAAAUUAUGGCUUA 


1919 


GCAAAGAAAGCUAGAAUUG 


1920 


CAUCAGAGAGUCUUAUAUA 


1921 


CCAAUGAGUUGCCAUAUAA 


1922 



CREBBP 



- Sense ; . Mr-- 




GGCCAUAGCUUAAUUAAUC 


1923 


GCACAGCCGUU.UACCAUGA 


1924 


GGACAGCCCUUUAGUC'AAG 


1925 


GAACUGAUUCCUGAAAUAA 


1926 



BTRC 



-ih ^ Sense . . 






CACAUAAACUCGUAUCUUA 




1927 


G AG AAGG C AC U C A AG U U U A 




1928 


I AG AC AU AG U U U AC AG AG AA 




1929 


GCAGAGAGAUUUCAUAACU 




1930 



RIPK2 



VAV1 



Sense- 




GAACAUACCUGUAAAUCAU 


1931 


GGACAUCGACCUGUUAUUA 


1932 


UAAAUGAACUCCUACAUAG 


1933 


GGAAUUAUCUCUGAACAUA 


1.934 




Sense 




GCAGAAAUACAUCUACUAA 


1935 


GCUAUGAGCUGUUCUUCAA 


1936 
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VAV2 



GRB2 



PLCG1 



ITGB1 



ITGA4 



STAT1 



KRAS2 



CGACAAAGCUCUACUCAUC 


1937 I 


GCUCAACCCUGG AG ACAU U 


1938 i 




« j," Sense.. ; 




GGACAAGACUCGCAGAUUU 


1939 


GCUGAGCGCUUUGCAAUAA 


1940 


CAAGAAGUCUCACGGGAAA 


1941 


UCACAGAGGCCAAGAAAUU 


1942 




:P' <w :/ /Sense:/ % 


"v- . ' : - 


UGGAAGCCAUCGCCAAAUA 


1942 


CAUCAG U GCAUGACG U U U A 


1943 


UGAAUGAGCUGGUGGAUUA 


1944 


U GCCAAAAC U U ACC U AU AA 


1945 




* . * W:- ' % i Sense : v£ - 




GAGCUGCACUCCAAUGAGA 


1946 


GAAACCAAGCCAUUAAUGA 


1 947 


CCAAGGAGCUACUGACAUU 


1948 


AGAGAAACAUGGCCCAAUA ~ | 1 949 




" %> ■ : ' -jH Sense • ; • : " f # : ;- 


*■ m.-. ' i ■ , ' 


CCAC AG AC AU U U AC AU U AA 


1950 


GAAGGGAGUUUGCUAAAUU 


1951 


GAACAGAUCUGAUGAAUGA 


1952 


CAAGAGAGCUGAAGACUAU 


1953 




h: vSerisS; r- '. 


,m. , ]. '.'-^ 


GCAUAUAUAUUCAGCAUUG 


1954 


CAACUUGACUGCAGUAUUG 


1955 


GAACUUAACUUUCCAUGUU 


1956 


GACAAGACCUGUAGUAAUU j 1957 




Si; ' ' 3 fir Sense : 


"> : .j .; . 


AGAAAGAGCUUGACAGUAA | 1 958 


GGAAGUAGUUCACAAAAUA | 1959 


UGAAGUAUCUGUAUCCAAA | 1960 


GAGCUUCACUCCCU.UAGUU 1961 




-. - • ' ' '■ fSenseiK • - ;V; ' 




UAAGGACUCUGAAGAUGUA 


1962 


GACAAAGUGUGUAAUUAUG 


1963 


GCUCAGGACUUAGCAAGAA 


1964 


GAAACUGAAUACCUAAGAU 


1965 


GAAACUGAAU ACCUAAGAU I 1 966 


UAAGGACUCUGAAGAUGUA 


1967 


GACAAAGUGUGUAAUUAUG 


1968 


G C UCAGG AC U U AG C AAG AA 


1969 



HRAS 


Sense 






CCAUCCAGCUGAUCCAGAA 


1970 




GAACCCUCCUGAUGAGAGU 


1971 




GAGGACAUCCACCAGUACA 


1972 



BRAF 



Sense 


1 1 


GAUUAGAGACCAAGGAUUU 
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ELK1 



RALGDS 



PRKCA 



MAP2K4 



MAP2K7 



MAPK8 



MAPK9 



AIF1 



CCACUGAUGUGUGUUAAUU 




CAAUAGAACCUGUCAAUAU 


1974 


GAAGACAGGAAUCGAAUGA 


1975 




• * 1 Sense \ v 't- 




-.>•;" ' ,-' ■ * 
\ : -> ,v 


G AUG UG AG U AG AAG AG U U A 


1975 


GGAAGAAUUUGUACCAUUU 


1976 


GAACGACCUUUCUUUCUUU 


1977 


GGAGUCAUCUCUUCCUAUA 


1978 




"p Sense M ■ Jl 


- i.r ■ t , ff 

■ . i- ■:■ 


GGAGAAGCCUCACCUCUUG 






1979 


GCAGAAAGGACUCAAGAUU 


i 




1980 


GAGAACAACUACUCAUUGA 






1981 


GAACUUCUCGUCACUGUAU 


1982 




C ' ISense - W':- 






^ * . ,«-.■■'■ * - ■ 


GGAUUGUUCUUUCUUCAUA 


_ _ 




1983 


GAAGGGUUCUCGUAUGUCA 


1984 


GAAGAAGGAUGUGGUGAUU 


1985 


GGACUGGGAUCGAACAACA 


1986 






: r ■ • "t; V Serile: m 


\ „ * - i 

• ' v : 




..-*3g^ • '.••■ft? 


GGACAGAAGUGGAAAUAUU 




1987 


UCAAAGAGGUGAACAUUAA 




1988~ 


GACCAAAUCUCAG U UG U U U 


1989 


GGAGAAUGGUGCUGUUUAA 






1990 




■ : .SenseU... ' S • ' -<lf 


""■»-* » . i'A't 


G AAG AG ACC AAAG U AU AAU 


1991 


GAAGACCGGCCACGUCAUU 


1992 


GG AAG AG ACC AAAG UAUAA 


1993 


GCAUUGAGAUUGACCAGAA 


1994 


UGAGAGAACGAGAAAGUUG 






1995 


GUGAAACCCUGUCUGCAUU 






1996 


GGAUCUCUCUCAACAACUA 






1997 


ACAACU AGG UG AAC ACAU A 


■ ■ ■ 1 ...... 


i 


1998 




■ ^t Sei^Se- , ;" tr .-" 




UCACAGUCCUGAAACGAUA 


1999 


GAUUGGAGAUUCUACAUUC 




2000 


GCUCAUGGAUGCAAAUCUU 






2001 


G AAG C U AAG CC G ACC AU U U 







2002 




,. : / -S. Sense 




AAAGAGAGCUUAUCGUGAA 


2003 


G AUG AU AGG U UAGAAAUAG 


2004 


ACAAAGAAGUCAUGGAUUG 


i 
1 


2005 


GGAGCUGGAUCAUGAAAGA 




i 


2006 




Sense 




GAAAAGGGAUGAUGGGAUU 


2007 


CCUAGACGAUCCCAAAUAU 


2008 


GAGCCAAACCAGGGAUUUA 






2009 


UGAAACGAAUGCUGGAGAA 




t_ .. . 


2010 


UCACUCACCCAGAGAAAUA 


1 


2011 
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BBC3 



BCL2L1 



BCL2L1 1 



BID 



BIRC2 



BIRC3 



BIRC4 



BIRC5 



CCAAGAAAGCUAUCUCUGA j 


2012 


AGACUCACCUAGAGCUAAA | 


2013 




■ *.<:■'"*- Sensei \ 




CCUGGAGGGUCCUGUACAA 


2014 


GAGCAAAUGAGCCAAACGU | 


2015 


GGAGGGUCCUGUACAAUCU ! 


2016 


GACUUUCUCUGCACCAUGU I 


2017 




.--#'*. Sense" - : J. '.[: ' \ 


>■ ". > * * ' — ' ' * * 


CCAGGGAGCUUGAAAGUUU | 


2018 


AAAGUGCAG U UC AG UAAUA | 


2019 


GAGAAUCACUAACCAGAGA \ 


2020 


G AGCCCAU CCC U AU U AU AA | 


2021 








G AG ACG AG U U UAACGCU U A 


2022 


AAAGCAACCUUCUGAUGUA 


2023 


CCGAGAAGGUAGACAAUUG 


2024 


GCAAAGCAACCUUCUGAUG 


2025 


AGACAGAGCCACAAGGUAA 


2026 


GCAAGG AG G U U AG AG AAAU 


2027 


CAAGGAGGUUAGAGAAAUA 


2028 


UCUUACGACUGUUACGUUA 


2029 | 




•'; " SenseV-sJ 


* *i - ^ 
- ,V ■ 1 


GAAGACAUCAUCCGGAAUA 


2030 


CAACAGCG U U C C U AG AG AA 


2031 


GAAAUGGGAUGGACUGAAC 


2032 


ACGAUGAGCUGCAGACUGA 


! 2033 . 




• K . - ~ : ^Sense ' "Vim ■ i 


V'*'- T " - . * * 


GAAAGAAGCCUGCAUAUAA 


2034 


GAAAUUGACUCUACAUUGU 


2035 


ACAAAUAGCACUUAGGUUA 


! 2036 


GAAUACACCUGUGGUUAAA 


! 2037 






■ ■•->:-,> ' .• a - . V, 


GGAGAUGCCUGCCAUUAAA 


2038 


UCAAUGAUCUUGUGUUAGA 


2039 ! 


G AAAG AAC AU G U AAAG UG U 


I 2040 


GAAGAAAG AACAUG UAAAG 


! 2041 




I M • Sense 


•i- - . -1. 


GUAGAUAGAUGGCAAUAUG 


2042 


GAGGAGGGCUAACUGAUUG 


2043 


| GAGGAACCCUGCCAUGUAU 


2044 


i GCACGGAUCUUUACUUUUG 


2045 




Sense : 




GGCGUAAGAUGAUGGAUUU 


2046 


GCAAAGGAAACCAACAAUA 


2047 


GCACAAAGCCAUUCUAAGU 


2048 


CAAAGGAAACCAACAAUAA 


I 2049 



BRCA1 



Sense 
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CCAUACAGCUUCAUAAAUA 


t 
i 


2050 . 


GAAGAGAACUUAUCUAGUG 




2051 


GAAGUGGGCUCCAGUAUUA 


i 

i 


2052 


GCAAGAUGCUGAUUCAUUA 


i 


2053 


CCAUACAGCUUCAUAAAUA 


i 


2054 



CARD4 



■i. • ' • ; Sense *: . \ \ -\H- 




GAAAGUUAAUGUCAAGGAA 


2055 


GAGCAACACUGGCAUAACA 


2056 


U AACAGAGAU U UGCCUAAA 


2057 


GCGAAGAGCUGACCAAAUA 


2058 



CASP10 



CASP2 



CDKN1A 



~ ' \-i '■ Sense ,■ -C if: 


> ■. 


CAAAGGGUUUCUCUGUUUA 


2059 


GAAAUGACCUCCCUAAGUU 


2060 


GAAGGCAGCUGGUAUAUUC \ 2061 


GACAUGAUCUUCCUUCUGA \ 2062 


GCACUCUUCUGUUCCCUUA | 2063 ] 




1; W:- ' Sense - . jg: 




GUAUUAAACUCUCCUUUGA 


2064 


GCAAGGAGAUGUCUGAAUA 


2065 


CAACUUCCCUGAUCUUUAA 


2066 


GCUCAAAGAUGUAAUGUAG 


2067 




fil : - Sensefl? ";' : f . ' 


" V^r , " V- ■ 


GAACAAGGAGUCAGACAUU 


2068 


AAACUAGGCGGUUGAAUGA 


2069 


GAUGGAACUUCGACUUUGU 


2070 


GUAAACAGAUGGCACUUUG 


2071 



CFLAR 



5'*' ' ; %£ ■ "'Sense • ;> ;;: <fe- >;B: 'V 




GAUGUGUCCUCAUUAAUUU 


2072 


G AAG AG AG AU AC AAG AU G A 


2073 


GAGCAUACCUGAAGAGAGA 


2074 


GCUAUGAAGUCCAGAAAUU 


2075 



CLK2 



CLSPN 



CSNK2A 
1 





: ■ ; ii ... 


GUGAAUAUGUGAAAUAGUG " 


2076 


AAAGCAUGCUAGAGUAUGA 


2077 


U UAAG AAUG UGG AG AAG U A 


2078 


GAUAACAAGCUGACACAUA 


2079 




1 ' Sense 




GGACGUAAUUGAUGAAGUA 


2080 


GCAGAUGGGUUCUUAAAUG 


2081 


CAAAUGAGGUUGAGGAAAU 


2082 


GGAAAUACCUGGAGGAUGA 


2083 




■Sense 




GAUCCACGUUUCAAUGAUA 


2084 


GCAUUUAGGUGGAGACUUC 


2085 


G AUG U ACG AU U AU AG U U U G 


! 2086 


UGAAUUAGAUCCACGUUUC 


! 2087 
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r .. Sense 


....... 


GCACAAGAAUGGAUCACAA 


2088 


GCUGAAACAUGCAGUUGUA 


2089 


GUACGUACCAUGCAGAAUA 


2090 


GAACUUGCAUUGUGAUUGG 


2091 



CXCR4 



./- : ')/ 4 .'Sense ,r . 




GAAGCAUGACGGACAAGUA 


2092 


GAACAUUCCAGAGCGUGUA 


2093 


GUUCUUAGUUGCUGUAUGU 


2094 


CAUCAUGGUUGGCCUUAUC 


2095 



CXCR6 



c-: Sense - * ; ; ,; > v sJr 


'T.fe-'i;.'!;-;^!^ ^ <. .5-..- 


GGAACAAACUGGCAAAGCA 


2096 


GAUCAGAGCAGCAGUGAAA 


2097 


GGGCAAAACUGAAUUAUAA 


2098 


GAUCUCAGGUUCUCCUUGA 


2099 i 



DAXX 



:Sf#- , Sense * 




C U AC AG AU CU CC AAU G AAA 


2100 


GCUACAAGCUGGAGAAUGA 


2101 


GGAAACAGCUAUGUGGAAA 


2102 


GGAGUUGGAUCUCUCAGAA 


2103 



GAS41 



GTSE1 



r Sense 


•■1 - ; ' ; 


GUAGUAAGCUAAACUGAAA 


2104 


GACAAUAUGUUCAAGAGAA 


2105 


GACAACAUCUCGUCAGCUA 




2106 


UAUAUGAUGUGUCCAGUAA 




2107 








CAAAGAAGCUCACUUACUG 


2108 


G AAC AG CCCU AAAG U GG U U 


2109 


GAACAUGGAUGACCCUAAG 


t 

! 

L 


2110 


GGGCAAAGCUAAAUCAAGU 


i- 
i 


2111 



HDAC3 



8T ; . ■ Sensed ' ; 




GGAAAGCGAUGUGG AGAU U 


2112. . 


CCAAGACCGUGGCCUAUUU 


2113 


AAAGCGAUGUGGAGAUUUA 


2114 


GUGAGGAGCUUCCCUAUAG 


2115 



HDAC5 



HEC 







GAAUUCCUCUUGUCGAAGU 


2116 


GUUAUUAGCACCUUUAAGA ; 


2117 


GGAGGGAGGCCAUGACUUG 


2118 


CAGGAGAGCUCAAGAAUGG 


2119 


GG AU AUGG AU U U CAG U U AA 


2120 


GGAAGUCGGUGCCUUGGUU 


2121 


GGAAGGAGAGGACUGGUUU 


2122 




Sense 




GCAGAUACUUGCACGGUUU 


2123 


GAGUAGAACUAGAAUGUGA 


2124 


GCGAAUAAAUCAUGAAAGA 


2125 


GAAGAUGGAAUUAUGCAUA 


! 2126 
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GGCAAUGCGUCUCGCGAUA 


2127 


GAUCCGCAAUGAUGAGGAA 


2128 


GCAAUGCGUCUCGCGAUAA 


2129 


GAGGAACUCAAUAAGCUUU 


2130 



LMNB1 



; /.Sense • : 


' ". 

■'.■■««.-'. - i ' ■ 

* < * t ' « " / 


AAUAGAAGCUGUGCAAUUA 


2131 


CAACUGACCUCAUCUGGAA 


2132 


GAAGGAAUCUGAUCUUAAU 


2133 


GGGAAGGGUUUCUCUAUUA 


2134 



LMNB2 



MYB 



MYT1 



' ••:?ii#f ; - - Sense. ■ '"y-'Si$?''"| 


-r. ■ '* S. M 


GGAGGUUCAUUGAGAAUUG | 


2134 


GGCAAUAGCUCACCGUUUA | 


2135 


CAAAUACGCUUAGCUGUGU | 


2136 


GGAGAUCGCCUACAAGUUC I 


2137 




' "V&g,.- . Sensed ' *. 




GCAGAAACACUCCAAUUUA 


2138 


GUAAAUACGUGAAUGCAUU 


2139 


GCACUGAACUUUUGAGAUA 


2140 


GAAGAACAGUCAUUUGAUG 


2141 




■W i,- • , Sense ; 1'-; " ; 




GAGGUGAGCUGUUAAAUCA 


2142 


GCAGGGUGAUUUCCUAAUA 


2143 


GGGAGAAGAUAUUUAAUUG 


! 2144 


CAACUUCUCUCCUGAACUU 


I - 2145 



NFKBIB 



:) - k^.. ■ Sensed J '. vg;..?.* ' ' 




GGACACGGCACUGCACUUG 


2146 


GCACUUGGCUGUGAUUCAU 


2148 


'GAGACGAGGGCGAUGAAUA 


2149 


CAUGAACCCUUCCUGGAUU 


2150 



NFKBIA 



■•if- ^ - Sense - . V £K- " ' 


- , ■ 


GAACAUGGACUUGUAUAUU 


2151 


GAUGUGGGGUGAAAAGUUA j 21 52 


GG ACG AG AAAG AUC AU U G A 


2153 


AGGACGAGCUGCCCUAUGA 


2154 



NFKBIE 



, VrC ' Sense % S ■ 


GAAGGGAAGUUUCAGUAAC 


i 


2155 


GGAAGGGAAGUUUCAGUAA 


i 


2156 


GGAAACUGCUGCUGUGUAC 




2157 


GAACCAACCACUCAUGGAA 

* i ■ ■" ~» 




2158 



NUMA1 



NUP153 



Sense ;. 




GGGAACAGUUUGAAUAUAA 


2159 


GCAG U AGCC UGAAG CAG AA 


2160 


CGAGAAGGAUGCACAGAUA 


2161 


| GCAAGAGGCUGAGAGGAAA 


2162 




Sense v. ? 


.-.*■*' . * ; " 

* * * 
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OPA1 



GAAGACAAAUGAAAGCUAA 


2163 


GAUAAAGACUGCUGUUAGA 


2164 


GAG GAG AG C UC U AA U AU U A 


2165 


GAGGAAGCCUGAUUAAAGA 


2166 




: - "v ;v ;\ "<y y . ; f -Sense - r V"\" 




GAAAGAGCAUGAUGACAUA 


2167 


GAGGAGAGCUCUAUUAUGU 


2168 


GAAACUGAAUGGAAGAAUA 


2169 


AAAGAAGGCUGUACCGUUA 


2170 



PARVA 



PIK3CG 



PRKDC 



RASA1 



SLC9A1 



^ ; *v ..Sense ■ v .v. -'••,-.«. \ •. 




CUACAUGUCUUUGCUCUUA 


2171 


GCUAAGUCCUGUAAGAAUA 


2172 


CAAAGGCAAUGUACUGUUU 


2173 


GAACAAUGGUGGAUCCAAA 


2174 






;...■■*■.- :. i: - 


AAGUUCAGCUUCUCUAUUA 


2175 


GAAGAAAUCUCUGAUGGAU 


2176 


GAACACCUUUACUCUAUAA 


2177 


GCAUGGAGCUGGAGAACUA 


2178 




■ • Sense 


. , . 

\p 


GAUGAAAGCUCUAAAGAUG 


2179 


GAAAGGAGGUUCUAAACUA 


2180 


GGAAGAAGCUCAUUUGAUU 


2181 


GCAAAGAGGUGGCAGUUAA 


2182 




Sense • 


_ < - . 

* 


GGAAGAAGAUCCACAUGAA 


2183 


GAACAUACUUUCAGAGCUU 


2184 


GAACAAUCUUUGCUGUAUA 


2185 


UAACAGAACUGCUUCAACA 


2186 




im ■ , -issme urn. >mmm 




GAAGAGAUCCACACACAGU 


2187 


UCAAUGAGCUGCUGCACAU 


2188 


GAAGAUAGGUUUCCAUGUG 


2189 


| GAAUUACCCUUCCUCAUCU 


2190 



TEGT 



TERT 



; -^- "..nKSense . 'i ■" ; - ; 


: - i 


CUACAGAGCUUCAGUGUGA 


2191 


* 

i 


GAACAUAUUUGAUCGAAAG 


2192 




GAGCAAACCUAGAUAAGGA 


2193 




GCAUUGAUCUCUUCUUAGA 


2194 






Sense 




GGAAGACAGUGGUGAACUU 


2195 


GCAAAGCAUUGGAAUCAGA 


2196 


GAGCUGACGUGGAAGAUGA 


2197 


GAACGGGCCUGGAACCAUA 


2198 



TNFRSF 
6 


Sense 






GAUACUAACUGCUCUCAGA 


2199 
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TOP1 



TOP2A 



TOP3A 



TOP3B 



WEE1 



GAAAGAAUGGUGUCAAUGA 


2200 


UCAAUAAUGUCCCAUGUAA 


2201 


U CAUG AAUCUCC AACC U UA 


2202 


GAUGUUGACUUGAGUAMU ! 2203 




. "J Sense . ; . ' 




GAAAGGAAAUGACUAAUGA 


2204 


GAAGAAGGCUGUUCAGAGA 


2205 


GGAAGUAGCUACGUUCUUU 


2206 


GGACAUAAGUGGAAAGMG 


2207 




[ - ' »--'"-;S.ense * - . a/^r 


! ■"• ■ *? » i« 1 


GAAAGAGUCCAUCAGAUUU 


2208 I 


CAAACUACAUUGGCAUUUA 


2209 1 


AAACAGACAUGGAUGGAUA 


2210 


CGAAAGGAAUGGUUAACUA 


2211 




Sensed * 


■m^ - m \ ■ 


CCAGAAAUCUUCCACAGAA 


2212 


GAAACUAUCUGGAUGUGUA 


2213 


1 CCACAAAGAUGGUAUCGUA 


2214 


| GGAAAUGGCUGUGGUAACA 


2215 




:*ffi& -Ssnse >-f::,v 


i.: <\ii£:: < -i 


GAGACAAGAUGAAGACUGU 


2216 


GCACAUGGGCUGCGUCUUU 


2217 


CCAGUGCGCUUCAAGAUGA 


2218 


GAACAUCUGCUUUGAGGUU | 2219 




Sense . . - " ■ • • • - 


GGUAUUGCCUUGUGAAUUU 


2220 


GCAGAACAAUUACGAAUAG 


2221 


GUACAUAGCUGUUUGAAAU 


2222 


GCUGUAAACUUGUAGCAUU 


2223 



In addition, to identifying functional siRNA against gene families or 
pathways, it is possible to design duplexes against genes known to be involved in 
5 specific diseases. For example when dealing with human disorders associated with 
allergies, it will be beneficial to develop siRNA against a number of genes including 
but not limited to: 
the interleukin 4 receptor gene 

(SEQ. ID NO. 2224: UAGAGGUGCUCAUUCAUUU, 
10 SEQ. ID NO. 2225: GGUAUAAGCCUUUCCAAGA, 
SEQ. ID NO. 2225: ACACACAGCUGGAAGAAAU, 
SEQ. ID NO. 2226: UAACAGAGCUUCCUUAGGU), 



the Beta-arrestin-2 
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(SEQ. ID NO. 2227: GGAUGAAGGAUGACGACUA, 
SEQ. ID NO. 2228: ACACCAACCUCAUUGAAUU, 
SEQ. ID NO. 2229: CGAACAAGAUGACCAGGUA, 
SEQ. ID NO. 2230: GAUGAAGGAUGACGACUAU, ), 

5 

the interferon-gamma receptor 1 gene 
(SEQ. ID NO. 2231: CAGCAUGGCUCUCCUCUUU, 
SEQ. ID NO. 2232: GUAAAGAACUAUGGUGUUA, 
SEQ. ED NO. 2233 : GAAACUACCUGUUACAUUA, 
1 0 SEQ. ED NO. 2234: GAAGUGAGAUCCAGUAUAA), 

the matrix metalloproteinase MMP-9 
(SEQ. ID NO. 2235: GGAACCAGCUGUAUUUGUU, 
SEQ. ID NO. 2236: GUUGGAGUGUUUCUAAUAA, 
1 5 SEQ. ED NO. 2237: GCGCUGGGCUUAGAUCAUU, 
SEQ. ID NO. 2238: GGAGCCAGUUUGCCGGAUA), 

the Slcllal (Nrampl) gene 

(SEQ. ID NO. 2239: CCAAUGGCCUGCUGAACAA, 
20 SEQ. ID NO. 2240: GGGCCUGGCUUGCUCAUGA, 
SEQ. ID NO. 2241 : GGGCAGAGCUCCACCAUGA, 
SEQ. ED NO. 2242: GCACGGCCAUUGCAUUCAA), 

SPEMK5 

25 (SEQ. ED NO. 2243: CCAACUGCCUGUUCAAUAA, 
SEQ. ED NO. 2244: GGAUACAUGUGAUGAGUUU, 
SEQ. ID NO. 2245: GGACGAAUGUGCUGAGUAU, 
SEQ. ID NO. 2246: GAGCUUGUCUUAUUUGCUA,), 

30 the CYP1A2 gene 

(SEQ. ID NO. 2247: GAAAUGCUGUGUCUUCGUA, 
SEQ. ED NO. 2248: GGACAGCACUUCCCUGAGA, 
SEQ. ID NO. 2249: GAAGACACCACCAUUCUGA, 
SEQ. ID NO. 2250: GGCCAGAGCUUGACCUUCA), 
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thymosin-beta4Y 

(SEQ. ID NO. 2251: GGACAGGCCUGCGUUGUUU, 
SEQ. ID NO. 2252: GGAAAGAGGAAGCUCAUGA, 
5 SEQ. ID NO. 2253: GCAAACACGUUGGAUGAGU, 
SEQ. ID NO. 2254: GGACUAUGCUGCCCUUUUG, 

activin A receptor IB 

(SEQ. ID NO. 2255: ACAAGACGCUCCAGGAUCU, 
1 0 SEQ. ID NO. 2254: GCAACAGGAUCGACUUGAG, 
SEQ. ID NO. 2255: GAAGCUGCGUCCCAACAUC, 
SEQ. ID NO. 2256: GCAUAGGCCUGUAAUCGUA, 
SEQ. ID NO. 2257: UCAGAGAGUUCGAGACAAA, 
SEQ. ID NO. 2258: UGCGAAAGGUUGUAUGUGA, 
1 5 SEQ. ID NO. 2259: GCAACAGGAUCGACUUGAG, 
SEQ. ID NO. 2260: GAAUAGCGUUGUGUGUUAU, 
SEQ. ID NO. 2261 : UGAAUAGCGUUGUGUGUUA, 
SEQ. ID NO. 2262: GGGAUCAGUUUGUUGAAUA, 
SEQ. ED NO. 2263: GAGCCUGAAUCAUCGUUUA, ), 

20 " r 

ADAM 3 3 

(SEQ. ID NO. 2264: GGAAGUACCUGGAACUGUA, 
SEQ. ID NO. 2265: GGACAGAGGGAACCAUUUA, 
SEQ. ID NO. 2266: GGUGAGAGGUAGCUCCUAA, 
25 SEQ. ID NO. 2267 : AAAGACAGGUGGCCACUGA), 

the TAP 1 gene 

(SEQ. ID NO. 2268: GAAAGAUGAUCAGCUAUUU, 
SEQ. ID NO. 2269: CAACAGAACCAGACAGGUA, 
30 SEQ. ID NO. 2270: UGAGAAAUGUUCAGAAUGU, 
SEQ. ID NO. 2271 : UACCUUCACUCGAAACUUA, 

COX-2 

(SEQ. ID NO. 2272: GAACGAAAGUAAAGAUGUU, 
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SEQ. ID NO. 2273: GGACUUAUGGGUAAUGUUA, 
SEQ. ID NO. 2274: UGAAAGGACUUAUGGGUAA, 
SEQ. ID NO. 2275: GAUCAGAGUUCACUUUCUU), 

5 ADPRT 

(SEQ. ID NO. 2276: GGAAAGAUGUUAAGCAUUU, 
SEQ. ID NO. 2277: CAUGGGAGCUCUUGAAAUA, 
SEQ. ID NO. 2278: GAACAAGGAUGAAGUGAAG, 
SEQ. ID NO. 2279: UGAAGAAGCUCACAGUAAA, ), 

10 

HDC 

(SEQ. ID NO. 2280: CAGCAGACCUUCAGUGUGA, 
SEQ. ID NO. 2281: GGAGAGAGAUGGUGGAUUA, 
SEQ. ID NO. 2282: GUACAGAGCUGGAGAUGAA, 
1 5 SEQ. ID NO. 2283 : GAACGUCCCUUCAGUCUGU), 

HnmT 

(SEQ. ID NO. 2284: CAAAUUCUCUCCAAAGUUC, 
SEQ. ID NO. 2285: GGAUAUAUCUGACUGCUUU, 
20 SEQ. ID NO. 2286: GA GC AGAGCUUG GG AAAGA, 
SEQ. ID NO. 2287: GAUAUGAGAUGUAGCAAAU), 

GATA-3 

(SEQ. ID NO. 2288: GAACUGCUUUCUUUCGUUU, 
25 SEQ. ID NO. 2289: GCAGUAUCAUGAAGCCUAA, 
SEQ. ID NO. 2290: GAAACUAGGUCUGAUAUUC, 
SEQ. ID NO. 2291: GUACAGCUCCGGACUCUUC), 

Gab2 

30 (SEQ. ID NO. 2292: GCACAACCAUUCUGAAGUU, 
SEQ. ID NO. 2293: GGACUUAGAUGCCCAGAUG, 
SEQ. ID NO. 2294: GAAGGUGGAUUCUAGGAAA, 
SEQ. ID NO. 2295: GGACUAGCCCUGCUGUUUA), and 
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STAT6 

(SEQ. ID NO. 2296: GAUAGAAACUCCUGCUAAU, 
SEQ. ID NO. 2297: GGACAUUUAUUCCCAGCUA, 
SEQ. ID NO. 2298: GGACAGAGCUACAGACCUA, 
5 SEQ. ID NO. 2299: GGAUGGCUCUCCACAGAUA). 

In addition, rationally designed siRNA or siRNA pools can be directed against 
genes involved in anemia, hemophila or hypercholesterolemia. Such genes would 
include, but are not be limited to: 
10 APOA5 

(SEQ. ID NO. 2300: GAAAGACAGCCUUGAGCAA, 
SEQ. ID NO. 2301 : GGACAGGGAGGCCACCAAA, 
SEQ. ID NO. 2302: GGACGAGGCUUGGGCTJUUG, 
SEQ. ID NO. 2303: AGCAAGACCUCAACAAUAU), 

15 

HMG-CoA reductase 

(SEQ. ID NO. 2304: GAAUGAAGCUUUGCCCUUU, 
SEQ. ID NO. 2305: GAACACAGUUUAGUGCUUU, 
SEQ. ID NO. 2306: UAUCAGAGCUCUUAAUGUU, 
20 SEQ. ID NO. 2307: UGAAGAAUGUCUACAGAUA), 

NOS3 

(SEQ... ID NO. 2308: UGAAGCACCUGGAGAAUGA, 
SEQ. ID NO. 2309: CGGAACAGCACAAGAGUUA, 
25 SEQ. ID NO. 23 10: GGAAGAAGACCUUUAAAGA, . 
SEQ. ID NO. 2309: GCACAAGAGUUAUAAGAUC), 



ARH 

(SEQ. ID NO. 2310: CGAUACAGCUUGGCACUUU, 
30 SEQ. ID NO. 23 1 1 : GAGAAGCGCUGCCCUGUGA, 
SEQ. ID NO. 2312: GAAUCAUGCUGUUCUCUUU, 
SEQ. ID NO. 2313: GGAGUAACCGGACACCUUA), 



CYP7A1 
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(SEQ. ID NO. 2314: UAAGGUGACUCGAGUGUUU, 
SEQ. ID NO. 2315: AAACGACACUUUCAUCAAA, 
SEQ. ID NO. 2316: GGACUCAAGUUAAAGUAUU, 
SEQ. ID NO. 2317: GUAAUGGACUCAAGUUAAA), 

5 

FANCA 

(SEQ. ID NO. 2318: GGACAUCACUGCCCACUUC, 
SEQ. ID NO. 2319: AGAGGAAGAUGUUCACUUA, 
SEQ. ID NO. 2320: GAUCGUGGCUCUUCAGGAA, 
10 SEQ. ID NO. 2321 : G G AC AGAGGC AGAU AAG AA) , 

FANCG 

(SEQ. ED NO. 2322: GCACUAAGCAGCCUUCAUG, 
SEQ. ID NO. 2323 : GCAAGCAGGUGCCUACAGA, 
15 SEQ. ID NO. 2324: GGAAUUAGAUGCUCCAUUG, 
SEQ. ID NO. 2325 : GGACAUCUCUGCCAAAGUC), 

ALAS 

(SEQ. ID NO. 2326: C A AUAUGCCUG GAAACU AU , 
20 SEQ. ID NO. 2327: GGUUAAGACUCACCAGUUC,:* 
SEQ. ID NO. 2328: CAACAGGACUUUAGGUUCA, 
SEQ. ID NO. 2329: GCAUAAGAUUGACAUCAUC), 

PIGA 

25 (SEQ. ID NO. 2330: GAAAGAGGGCAUAAGGUUA, 
SEQ. ID NO. 2331 : GGACUGAUCUUUAAACUAU, 
SEQ. ID NO. 2332: UCAAAUGGCUUACUUCAUC, 
SEQ. ID NO. 2333: UCUAAGAACUGAUGUCUAA), and 

30 factor VIII 

(SEQ. ID NO. 2334: GCAAAUAGAUCUCCAUUAC, 
SEQ. ID NO. 2335: CCAGAUAUGUCGUUCUUUA, 
SEQ. ID NO. 2336: GAAAG GCUGUGCUCUCAAA , 
SEQ. ID NO. 2337: GGAGAAACCUGCAUGAAAG, 
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SEQ. ID NO. 2338: CUUGAAGCCUCCUGAAUUA, 
SEQ. ID NO. 2339: GAGGAAGCAUCCAAAGAUU, 
SEQ. ID NO. 2340: G AUAG G AG AU AC AAACUUU) . 



5 Furthermore, rationally designed siKNA or siRNA pools can be directed 

against genes involved in disorders of the brain and nervous system. Such genes 
would include, but are not be limited to: 
APBB1 

(SEQ. ID NO. 2341: CUACGUAGCUCGUGAUAAG, 
10 SEQ. ID NO. 2342: GCAGAGAUGUCCACACGUU, 
SEQ. ID NO. 2343: CAUGAGAUCUGCUCUAAGA, 
SEQ. ID NO. 2344: GGGCACCUCUGCUGUAUUG), 

BACE1 

15 (SEQ. ID NO. 2345: CCACAGAGCAAGUGAUUUA, 
SEQ. ID NO. 2346: GCAGAAAGGAGAUCAUUUA, 
SEQ. ID NO. 2347: GUAGCAAGAUCUUUACAUA, 
SEQ. ID NO. 2348: UGUCAGAGCUUGAUUAGAA), 

20 PSEN1 

(SEQ. ID NO. 2349: GAGCUGACAUUGAAAUAUG, 
SEQ. ID NO. 2350: GUACAGCUAUUUCUCAUCA, 
SEQ. ID NO, 2351: GAGGUUAGGUGAAGUGGUU, 
SEQ. ID NO. 2352: GAAAGGGAGUCACAAGACA, 
25 SEQ. ID NO. 2353 : GAACUGGAGUGGAGUAGGA, 
SEQ. ID NO. 2354: CAGCAGGCAUAUCUCAUUA, 
SEQ. ID NO. 2355: UCAAGUACCUCCCUGAAUG), 



PSEN2 

30 (SEQ. ID NO. 2356: GCUGGGAAGUGGCUUAAUA, 
SEQ. ID NO. 2357: CAUAUUCCCUGCCCUGAUA, 
SEQ. ID NO. 2358: GGGAAGUGCUCAAGACCUA, 
SEQ. ID NO. 2359: CAUAGAAAGUGACGUGUUA), 
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MASS1 • 

(SEQ. ID NO. 2360: GGAAGGAGCUGUUAUGAGA, 

4 

SEQ. ID NO. 2361: GAAAGGAGAAGCUAAAUUA, 
SEQ. ID NO. 2362: GGAGGAAGGUCAAGAUUUA, 
5 SEQ. ID NO. 2363 : GGAAAUAGCUGAGAUAAUG, ), 

ARX 

(SEQ. ID NO. 2364: CCAGACGCCUGAUAUUGAA, 
SEQ. ID NO. 2365: CAGCACCACUCAAGACCAA, 
10 SEQ. ID NO. 23 66 : CGCCUGAUAUUGAAGUAAA, 

SEQ. ID NO. 2367: CAACAUCCACUCUCUCUUG) and 

NNMT 

(SEQ. ED NO. 2368: GGGCAGUGCUCCAGUGGUA, 
1 5 SEQ. ID NO. 2369: GAAAGAGGCUGGCUACACA, 
SEQ. ID NO. 2370: GUACAGAAGUGAGACAUAA, 
SEQ. ID NO. 2371: GAGGUGAUCUCGCAAAGUU). 



In addition, rationally designed siRNA or siRNA pools can be directed against 
20 genes involved in hypertension and related disorders. Such genes would include, but 
are not be limited to : 
angiotensin II type 1 receptor 

(SEQ. IDNO.,23-72: CAAGAAGCCUGCACCAUGU, " 
SEQ. ID NO. 2373: GCACUUCACUACCAAAUGA, 
25 SEQ. ID NO. 2374: GCACUGGUCCCAAGUAGUA, 
SEQ. ID NO. 2375: CCAAAGGGCAGUAAAGUUU, 
SEQ. ID NO. 2376: GCUCAGAGGAGGUGUAUUU, 
SEQ. ID NO. 2377: GCACUUCACUACCAAAUGA, 
SEQ. ID NO. 2378: AAAGGGCAGUAAAGUUU), 

30 

AGTR2 

(SEQ. ID NO. 2379: GAACAUCUCUGGCAACAAU, 
SEQ. ID NO. 2380: GGUGAUAUAUCUCAAAUUG, 
SEQ. ID NO. 2381: GCAAGCAUCUUAUAUAGUU, 
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SEQ. ED NO. 2382: GAACCAGUCUUUCAACUCA), and other related targets. 



Example XIII: Validation of Muitigene Knockout using Rab5 and Eps 

Two or more genes having similar, overlapping functions often leads to 

5 genetic redundancy. Mutations that knockout only one of, e.g., a pair of such genes 
(also referred to as homologs) results in little or no phenotype due to the fact that the 
remaining intact gene is capable of fulfilling the role of the disrupted counterpart. To 
fully understand the function of such genes in cellular physiology, it is often 
necessary to knockout or knockdown both homologs simultaneously. Unfortunately, 

1 0 concomitant knockdown of two or more genes is frequently difficult to achieve in 

higher organisms (e.g. mice) thus it is necessary to introduce new technologies dissect 
gene function. One such approach to knocking down multiple genes simultaneously 
is by using siRNA. For example, Figure 11 showed that rationally designed siRNA 
directed against a number of genes involved in the clathrin-mediated endocytosis 

1 5 pathway resulted in significant levels of protein reduction (e.g. >80%). To determine 
the effects of gene knockdown on clathrin-related endocytosis, internalization assays 
were performed using epidermal growth factor and transferrin. Specifically, mouse 
receptor-grade EGF (Collaborative Research Inc.) and iron-saturated human 
transferrin (Sigma) were iodinated as described previously (Jiang, X., Huang, F., 

20 Marusyk, A. & Sorkin, A. (2003) Mol Biol Cell 14, 858-70). HeLa- cells grown in 12- 

1 9 S 

well dishes were incubated with 125 I-EGF (1 ng/ml) or I-transferrin (1 |ig/ml) in 
binding medium (DMEM, 0.1% bovine serum albumin) at 37°C, and the ratio of 
internalized 1 dnd surface radioactivity was determined during 5-min time course to 
calculate specific internalization rate constant k e as described previously (Jiang, X et 
25 al). The measurements of the uptakes of radiolabeled transferrin and EGF were 
performed using short time-course assays to avoid influence of the recycling on the 
uptake kinetics, and using low ligand concentration to avoid saturation of the clathrin- 
dependent pathway (for EGF Lund, K. A., Opresko, L. K., Strarbuck, C., Walsh, B. J. 
& Wiley, H. S. (1990) J. Biol .Chem. 265, 15713-13723). 



30 



The effects of knocking down Rab5a, 5b, 5c, Eps, or Eps 15R (individually) 
are shown in Figure 22 and demonstrate that disruption of single genes has little or no 
effect on EGF or Tfh internalization. In contrast, simultaneous knock down of RabSa, 
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5b s and 5c, or Eps and Eps 1 5R, leads to a distinct phenotype (note: total 
concentration of siRNA in these experiments remained constant with that in 
experiments in which a single siRNA was introduced, see Figure 23). These 
experiments demonstrate the effectiveness of using rationally designed siRNA to 
5 knockdown multiple genes and validates the utility of these reagents to override 
genetic redundancy. 

Example XIV. Validation of Multigene Targeting Using G6PD, GAPDH, PLK, 
and UQC. 

Further demonstration of the ability to knock down expression of multiple 
genes using rationally designed siRNA was performed using pools of siRNA directed 
against four separate genes. To achieve this, siRNA were transfected into cells (total 
siRNA concentration of lOOnM) and assayed twenty-four hours later by B-DNA. 
Results shown in Figure 24 show that pools of rationally designed molecules are 
capable of simultaneously silencing four different genes. 



10 



15 



Example XV. Validation of Multigene Knockouts As Demonstrated by Gene 
Expression Profiling, a Prophetic Example 

To further demonstrate the ability to concomitantly knockdown the expression 
20 of multiple gene targets, single siRNA or siRNA pools directed against a collection of 
genes (e.g. 4, 8, 16, or 23 different targets) are simultaneously transfected into cells 
and cultured for twenty- four hours. Subsequently, mRNA is harvested from treated 
(and untreated) cells- andJabeled with one of two fluorescent probes dyes (e.g. a red 
fluorescent probe for the treated cells, a green fluorescent probe for the control cells.). 
25 Equivalent amounts of labeled RNA from each sample is then mixed together and 
hybridized to sequences that have been linked to a solid support (e.g. a slide, "DNA 
CHIP"). Following hybridization, the slides are washed and analyzed to assess 
changes in the levels of target genes induced by siRNA. 



30 Example XVI. Identifying Hyperfunctional siRNA 

Identification of Hyperfunctional Bcl-2 siRNA 

The ten rationally designed Bcl2 siRNA (identified in Figure 13, 14) were 
tested to identify hyperpotent reagents. To accomplish this, each of the ten Bcl-2 
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siRNA were individually transfected into cells at a 300pM (0.3nM) concentrations. 
Twenty-four hours later, transcript levels were assessed by B-DNA assays and 
compared with relevant controls. As shown in Figure 25, while the majority of Bcl-2 
siRNA failed to induce functional levels of silencing at this concentration, siRNA 1 
5 and 8 induced >80% silencing, and siRNA 6 exhibited greater than 90% silencing at 
this subnanomolar concentration. 



By way of prophetic examples, similar assays could be performed with any of 
the groups of rationally designed genes described in Example VII or Example VIII. 
10 Thus for instance, rationally designed siRNA sequences directed against 
PDGFA 

(SEQ. ID NO. 2383: GGUAAGAUAUUGUGCUUUA, 
SEQ. ID NO. 2384: CCGCAAAUAUGOAGAAUUA, 
SEQ. ID NO. 2385: GGAUGUACAUGGCGUGUUA, 
15 SEQ. ID NO. 2386: GGUGAAGUUUGUAUGUUUA), or 



PDGFB 

(SEQ. ID NO. 2387: GCUCCGCGCUUUCCGAUUU, 
SEQ. ID NO. 2388: GAGCAGGAAUGGUGAGAUG, 
20 SEQ. ID NO. 2389: GAACUUGGGAUAAGAGUGU, _: 
SEQ. ID NO. 2390: CCGAGGAGCUUUAUGAGAU, 
SEQ. ID NO. 2391: UUUAUGAGAUGCUGAGUGA) 

could be introduced 'intoieells at increasingly limiting concentrations to determine 
whether any of the duplexes are hyperfunctional. Similarly, rationally designed 
25 sequences directed against 
HIF1 alpha 

(SEQ. ID NO. 2392: GAAGGAACCUGAUGCUUUA, 
SEQ. ID NO. 2393: GCAUAUAUCUAGAAGGUAU, 
SEQ. ED NO. 2394: GAACAAAUACAUGGGAUUA, 
30 SEQ. ID NO. 2395: GGACACAGAUUUAGACUUG), or 

VEGF 

(SEQ. ID NO. 2396: GAACGUACUUGCAGAUGUG, 
SEQ. ID NO. 2397: GAGAAAGCAUUUGUUUGUA, 
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SEQ. ID NO. 2398: GGAGAAAGCAUUUGUUUGU, 

SEQ. ID NO. 2399: CGAGGCAGCUUGAGUUAAA) could be introduced into cells 
at increasingly limiting concentrations and screened for hyperfunctional duplexes. 



5 Example XVII: Gene Silencing: Prophetic Example 

Below is an example of how one might transfect a cell. 

a. Select a cell line. The selection of a cell line is usually determined by the 
desired application. The most important feature to RNAi is the level of 
expression of the gene of interest. It is highly recommended to use cell lines 
10 for which siRNA transfection conditions have been specified and validated. 



b. Plate the' Cells. Approximately 24 hours prior to transfection, plate the cells at 
the appropriate density so that they will be approximately 70 - 90% confluent, 
or approximately 1 x 1 0 5 cells/ml at the time of transfection. Cell densities 

1 5 that are too low may lead to toxicity due to excess exposure and uptake of 

transfection reagent-siRNA complexes. Cell densities that are too high may 
lead to low transfection efficiencies and little or no silencing. Incubate the 
cells overnight. Standard incubation conditions for mammalian cells are 37°C 
in 5% C0 2 . Other cell types, such as insect cells, require different 

20 temperatures and CO2 concentrations that are readily ascertainable by,persons 

skilled in the art. Use conditions appropriate for the cell type of interest. 



SiRNA re-suspension-aiAdd 20 |xl siRNA universal buffer to each siRNA to 
generate a final concentration of 50 |iM. 

SiRNA-lipid complex formation. Use RNase-free solutions and tubes. Using 
the following table, Table XI: 
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e. 



Table XI 


* * K *t » , T 

■- ■ - * 


96-well 


24-Svell : 


Mixture 1 (TransIT-TKO-Plasmid dilution mixture) 


Opti-MEM 


9.3 |il 


46.5 ul 


TranslT-TKO (1 ug/HD 


0.5 |il 


2.5 ul 








v Mixture 1 -Final Volume- 


l&Oul ^ 


1 50.6 jil 



Mixture 2 (siRNA dilution mixture) 


Opti-MEM 


9.0 \i\ 


45.0 ul 


siRNA (1 uM) 


1.0 Hi 


5.0 ul 


Mixture 2 Final Volume 


^10.0 |il 


50.0 ui# 




Mixture 3 (siRNA-Transfection reagent mixture) 


Mixture 1 


10 |il 


50 Hi 


Mixture 2 


10 ul 


50 Hi 


l#Mixture 3 Final Volume^ 


20^|il 


liiiooui; *3 




Incubate 20 minutes at room temperature. 


Mixture 4 (Media-siRNA/Transfection reagent mixture) 


Mixture 3 


20 Hi 


100 Hi 


Complete media 


80 Hi 


400 Hi 


V Mix&e-^F^ 




500 HI 




Incubate 48 hours at 37°C. 



5 Transfection . Create a Mixture 1 by combining the specified amounts of OPTI-MEM 
serum free media and transfection reagent in a sterile polystyrene tube. Create a 
Mixture 2 by combining specified amounts of each siRNA with OPTI-MEM media in 
sterile 1 ml tubes. Create a Mixture 3 by combining specified amounts of Mixture 1 
and Mixture 2. Mix gently (do not vortex) and incubate at room temperature for 20 

10 minutes. Create a Mixture 4 by combining specified amounts of Mixture 3 to 

complete media. Add appropriate volume to each cell culture well. Incubate cells 
with transfection reagent mixture for 24 - 72 hours at 37°C. This incubation time is 
flexible. The ratio of silencing will remain consistent at any point in the time period. 
Assay for gene silencing using an appropriate detection method such as RT-PCR, 

1 5 Western blot analysis, immunohistochernistry s phenotypic analysis, mass 
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spectrometry, fluorescence, radioactive decay, or any other method that is now known 
or that comes to be known to persons skilled in the art and that from reading this 
disclosure would useful with the present invention. Tb e optimal window for 
observing a knockdown phenotype is related to the mRNA turnover of the gene of 
5 interest, although 24 - 72 hours is standard. Final Volume reflects amount needed in 
each well for the desired cell culture format. When adjusting volumes for a Stock' 
Mix, an additional 10% should be used to accommodate variability in pipetting, etc. 
Duplicate or triplicate assays should be carried out when possible. 

10 While the invention has been described in connection with specific 

embodiments thereof, it will be understood that it is capable of further modifications 
and this application is intended to cover any variations, uses, or adaptations of the 
invention following, in general, the principles of the invention and including such 
departure from the present disclosure as come within known or customary practice 

15 within the art to which the invention pertains and as may be applied to the essential 
features hereinbefore set forth and as follows in the scope of the appended claims. 



. -20 
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Claims 

1 . A method for selecting siRNA comprising selecting an siRNA molecule of 
19 — 25 nucleoside bases, said method comprising: 

(a) selecting a target gene; 

(b) measuring the functionality of sequences of 19 - 25 nucleotides in 
length that are substantially complementary to a stretch of nucleotides 
of the target sequence, wherein said functionality is dependent upon 
non-target specific criteria. 



10 2. The method according to claim 1 wherein said functionality is determined 

by applying one of the following formulas: 



15 



30 



Formula I = -(GC/3) +(AUi 5 -i 9 ) -(Tm 20 °c)*3 -(G ]3 )*3 -(C 19 ) +(Ai 9 )*2 +(A 3 ) 
+(U 10 )+(Ai4)-(U5)-(A n ); 



Formula II = -(GC/3) -(AU 15 -i 9 )*3 -(G 13 )*3 -(C 19 ) +(A 19 )*2 +(A 3 ); 



Formula III = -(GC/3) +(AUi 5 -i 9 ) -(Tm 20 <>c)*3; 



20 Formula IV - -(GC/2)+(AU 15 -i9)/2-(Tm 2 ox) st: 2 -(Gi 3 )*3 -(Ci 9 ) +(A !9 )*2 

+(A 3 ) +(U 1 o)+(Ai 4 )-(U 5 )-(A 11 ); 



Formula V « -(Gi 3 )*3 -(C 19 ),±(Ai#2 +(A 3 ) + (U 10 )+(Ai 4 ) -(Us) -(An); 



25 Formula VI = -(Gi 3 )*3 -(Ci 9 ) +(A 19 )*2 +(A 3 ); 



Formula VII = -(GC/2) +(AU 15 -i9)/2 -( Tm 2 o°c)*l ~(G !3 )*3 -(C ]9 ) +(A 19 )*3 
+(A 3 )*3 +(U 10 )/2+(A 14 )/2 -(U 5 )/2 -(An)/2; 



wherein in Formulas I - VII: 



AUi5_i 9 = 0-5 depending on the number of A or U bases on the sense 
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strand at positions 15-19; 
Gn = 1 if G is the base at position 13 on the sense strand, otherwise its 

value is 0; 

Ci 9 = 1 if C is the base at position 1 9 of the sense strand, otherwise its 

5 value is 0; 

GC = the number of G and C bases in the entire sense strand; 
Tm 20 °c = 1 if the Tm is greater than 20°C; 

A 3 = 1 if A is the base at position 3 on the sense strand, otherwise its 
value is 0; 

10 Ai i = 1 if A is the base at position 1 1 on the sense strand, otherwise its 

value is 0; 

A w = 1 if A is the base at position 14 on the sense strand, otherwise its 
value is 0; 

A19 = 1 if A is the base at position 19 on the sense strand, otherwise its 

1 5 value is 0; 

U 5 = 1 if U is the base at position 5 on the sense strand, otherwise its 

value is 0; 

Uio= 1 if U is the base at position 10 on the sense strand, otherwise its 
value is 0; 

20 .... or, 



Formula VIII: (-14)*Gi 3 -13*Ai-12*«^^ 

9*Aio-9*U9-9*Ci8-8*G 1 o-7*U r 7 :i: Ui6-7*Ci7-7*C 1 9 

25 +7*Uit+8*A2+8*A4+8*A^ 

ll*C 9 +15*Gi+ 18*A 3 +19*Uio-Tm-3* (GC to tai)-6*(GCi 5 -i9)- 

30*X; and 



* 



Formula IX: (14.1)*A 3 +(14.9)*A6+(17.6)*Ai3+(24.7)*A 19 +(14.2)*Uio+(10.5) 

30 . C 9 +(23.9)*Gi+(163)*G 2 ^ 

(-1 l)*U 3 +(-l 5.2)nJi 5 +(-l 1 .3) *Ui6 + (-1 1 .8)*C 3 +(-17.4)*C6+(- 
10.5)*C 7 + (.13.7)^G 13 +(-25.9)*G!r-Tm-3* (GC total ) - 6*(GC 15 -i 9 > 

30*X 

wherein 
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Ai = 1 if A is the base at position 1 of the sense strand, otherwise its value is 0; 

A2 = 1 if A is the base at position 2 of the sense strand, otherwise its value is 0; 

A3 = 1 if A is the base at position 3 of the sense strand, otherwise its value is 0; 

A4 = 1 if A is the base at position 4 of the sense strand, otherwise its value is 0; 
5 A5 = 1 if A is the base at position 5 of the sense strand, otherwise its value is 0; 

Ae = 1 if A is the base at position 6 of the sense strand, otherwise its value is 0; 

A7 = 1 if A is the base at position 7 of the sense strand, otherwise its value is 0; 

A10 = 1 if A is the base at position 10 of the sense strand, otherwise its value is 0; 

An = 1 if A is the base at position 11 of the sense strand, otherwise its value is 0; 
10 A13 = 1 if A is the base at position 13 of the sense strand, otherwise its value is 0; 

A19 = 1 if A is the base at position 19 of the sense strand, otherwise if another base 
is present or the sense strand is only 1 8 base pairs in length, its value is 0; 

C3 = 1 if C is the base at position 3 of the sense strand, otherwise its value is 0; 
15 C4 = 1 if C is the base at position 4 of the sense strand, otherwise its value is 0; 

C5 = 1 if C is the base at position 5 of the sense strand, otherwise its value is 0; 
= 1 if C is the base at position 6 of the sense strand, otherwise its value is 0; 

C7 = 1 if C is the base at position 7 of the sense strand, otherwise its value is 0; 

C9 = 1 if C is the base at position 9 of the sense strand, otherwise its value is 0; 
20 C17 = 1 if C is the base at position 17 of the sense strand, otherwise its value is 0; 

Cis = 1 if C is the base at position 1 8 of the sense strand, otherwise its value is 0; 

C19 = 1 if C is the base at position 19 of the sense strand, otherwise if another base 
is present or the sense strand is only^L&hase pairs in length, its value is 0; 

25 Gi = 1 if G is the base at position 1 on the sense strand, otherwise its value is 0; 

G2 = 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 

Gg = 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 

G10 = 1 if G is the base at position 10 on the sense strand, otherwise its value is 0; 

Gn = 1 if G is the base at position 1 3 on the sense strand, otherwise its value is 0; 
30 G19 = 1 if G is the base at position 19 of the sense strand, otherwise if another base 

is present or the sense strand is only 18 base pairs in length, its value is 0; 

Ui = 1 if U is the base at position 1 on the sense strand, otherwise its value is 0; 
U2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0; 
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U3 = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0; 
U4 = 1 if U is the base at position 4 on the sense strand, otherwise its value is 0; 
U7 = 1 if U is the base at position 7 on the sense strand, otherwise its value is 0; 
U 9 = 1 if U is the base at position 9 on the sense strand, otherwise its value is 0; 
U10 = 1 if U is the base at position 10 on the sense strand, otherwise its value is 0; 
U15 = 1 if U is the base at position 15 on the sense strand, otherwise its value is 0; 
Ui6 = 1 if U is the base at position 1 6 on the sense strand, otherwise its value is 0; . 
U17 = 1 if U is the base at position 17 on the sense strand, otherwise its value is 0; 
Ui8 = 1 if U is the base at position 18 on the sense strand, otherwise its value is 0; 

GCi5_i9 = the number of G and C bases within positions 15 - 19 of the sense strand 
or within positions 1 5 -1 8 if the sense strand is only 1 8 base pairs in length; 

GQotai = the number of G and C bases in the sense strand; 

Tm= 100 if the targeting site contains an inverted repeat longer then 4 base pairs, 
otherwise its value is 0; and 

X = the number of times that the same nucleotide repeats four or more times in a 

row. 

3 . A method of gene-silencing comprising selecting an siRNA according to 
claim 2 and introducing it into a cell. 

4. The method according to claim 3 wherein said introducing is by allowing 
passive uptake of the siRNA. ^^^less^- , 

5. The method according to claim 3, wherein said introducing is through the 
use of a vector. 

6. A method for developing an siRNA algorithm for selecting siRNA, said 
method comprising: 

(a) selecting a set of siRNA; 

(b) measuring the gene silencing ability of each siRNA from said set; 

(c) determining the relative functionality of each siRNA; 

(d) determining the amount of improved functionality by the presence or 
absence of at least one variable selected from the group consisting of 
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the total GC content, melting temperature of the siRNA, GC content at 
positions 15-19, the presence or absence of a particular nucleotide at 
a particular position and the number of times that the same nucleotide 
repeats within a given sequence; and 
5 (e) developing an algorithm using the information of step (d). 



7. A method of selecting an siRNA with improved functionality, said method 
comprising using the algorithm of claim 6. 

10 8. A method of selecting hyperfunctional siRNA, said method comprising 

using at least one functional siRNA, wherein at least one said functional 
siRNA has been selected according to the method of claim 7 and 
measuring the silencing ability of said at least one functional siRNA, 
wherein silencing ability is measured at a concentration of less than 1 

1 5 nanomolar siRNA. 



9. An siRNA molecule, wherein said siRNA molecule is effective at 
silencing Bcl-2. 

20 10. , The siRNA molecule of claim 9, wherein said siRNA molecule comprises 

a sequence substantially similar to a sequence selected from the group 

consisting of GGGAGAUAGUGAUGAAGUA (SEQ. ID NO. 301); 

GAAGUACAUCCAUUAUAAG (SEQ, I©iM0.,3O2); 

GUACGACAACCGGGAGAUA (SEQ. ID NO. 303); 
25 AGAUAGUGAUGAAGUACAU (SEQ. ID NO. 304); 

UGAAGACUCUGCUCAGUUU (SEQ. ID NO. 305); 

CAUGCGGCCUCUGUUUGA (SEQ. ID NO. 306); 

UGCGGCCUCUGUUUGAUUU (SEQ. ID NO. 307); 

GAGAUAGUGAUGAAGUACA (SEQ. ID NO. 308); 
30 GGAGAUAGUGAUGAAGUAG (SEQ. ID NO. 309); and 

GAAGACUCUGCUCAGUUUG (SEQ. ID NO. 310). 



11. 



The siRNA molecule of claim 10, wherein said siRNA molecule 
comprises a sequence selected from the group consisting of 
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GGGAGAUAGUGAUGAAGUA (SEQ. ID NO. 301); 
GAAGUACAUCCAUUAUAAG (SEQ. ID NO. 302); 
GUACGACAACCGGGAGAUA (SEQ. ID NO. 303); 
AGAUAGUGAUGAAGUACAU (SEQ. ID NO. 304); 
UGAAGACUCUGCUCAGUUU (SEQ. ID NO. 305); 
GCAUGCGGCCUCUGUUUGA (SEQ. ID NO. 306); 
UGCGGCCUCUGUUUGAUUU (SEQ. ID NO. 307); 
GAGAUAGUGAUGAAGUACA (SEQ. ID NO. 308); 
GGAGAUAGUGAUGAAGUAC (SEQ. ID NO. 309); and 
GAAGACUCUGCUCAGUUUG (SEQ. ID NO. 3 1 0). 

1 2. The siRNA molecule of claim 1 1 j°wherein said siRNA molecule 
comprises GCAUGCGGCCUCUGUUUGA . 

1 3 . The siRNA molecule of claim 9, wherein said siRNA molecule comprises 
a sense strand and an anti-sense strand. ' 



14. The siRNA molecule of claim 9, wherien said siRNA molecule comprises 
a hairpin. 

15. The siRNA molecule of claim 9, wherein said siRNA molecule comprises 
between 18 and 30 base pairs. 

16. A kit for gene silencing comprising at least one siRNA selected from the 
group consisting of sequences substantially similar to the group consisting 
of GGGAGAUAGUGAUGAAGUA (SEQ. ID NO. 301); 
GAAGUACAUCCAUUAUAAG (SEQ. ID NO. 302); 
GUACGACAACCGGGAGAUA (SEQ. ID NO. 303); 
AGAUAGUGAUGAAGUACAU (SEQ. ID NO. 304); 
UGAAGACUCUGCUCAGUUU (SEQ. ID NO. 305); 
GCAUGCGGCCUCUGUUUGA (SEQ. ID NO. 306); 
UGCGGCCUCUGUUUGAUUU (SEQ. ID NO. 307); 
GAGAUAGUGAUGAAGUACA (SEQ. ID NO. 308); 
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GGAGAUAGUGAUGAAGUAC (SEQ. ID NO. 309); and 
GAAGACUCUGCUCAGUUUG (SEQ. ID NO. 310). 

17. A method of gene silencing comprising using the siRNA molecule of 
5 claim 10. 



18. A method of gene silencing comprising using the siRNA molecule of claim 
11. 

10 1 9. A kit, wherein said kit is comprised of at least two siRNA, wherein said at 

least two siRNA comprise a first optimized siRNA and a second optimized 
siRNA, wherein said first optimized siRNA and said second optimized 
siRNA are optimized according to one of the following formulas: 

1 5 Formula I - -(GC/3) +(AUis-i!>) -(Tm 20 °c)*3 -(Gi 3 )*3 -(Ci 9 ) +(A, 9 )*2 +(A 3 ) 

+(U 10 )+(A 14 )-(U5)-(Au); 

Formula II = -(GC/3) -(AUi 5 -i9)*3 -(Gi 3 )*3 -(Ci 9 ) +(A 19 )*2 +(A 3 ); 
20 Formula III = -(GC/3) +(AUi5-i 9 ) -(Tm 2 o°c)*3; 

Formula IV = -(GC/2)+(AUi 5 .i 9 )/2-(Tm 2 o-c)*2 -(G ]3 )*3 -(Q 9 ) +(A 19 )*2 
+(A 3 ) +(Uio)+(Ai 4 ) -(U 5 ) -(An); - 

Formula V = -(Gi 3 )*3 -(Ci 9 ) +(A i9 )*2 +(A 3 ) + (U 10 )+(A 14 ) -(U 5 ) -(A u ); 
Formula VI = -(Gi 3 )*3 -(Ci 9 ) +(Ai 9 )*2 +(A 3 ); 

Formula VII = -(GC/2) +(AU, 5 -i 9 )/2 -( Tm 20 «c)*l ~(G 13 )*3 -(Ci 9 ) +(A 19 )*3 
+(A 3 )*3 +(U,o)/2+(A, 4 )/2 -(U 5 )/2 -(A, 0/2; - 



25 



30 



wherein in Formulas I - VII: 
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AU15.19 = 0-5 depending on the number of A or U bases on the sense 

strand at positions 15-19; 
G13 = 1 if G is the base at position 13 on the sense strand, otherwise its 
5 value is 0; 

C19 = 1 if C is the base at position 19 of the sense strand, otherwise its 

value is 0; 

GC = the number of G and C bases in the entire sense strand; 
Tm 20°c = 1 if the Tm is greater than 20°C; 
10 A3 = 1 if A is the base at position 3 on the sense strand, otherwise its 

value is 0; 

An = 1 if A is the base at position 1 1 oh" the sense strand, otherwise its 
value is 0; 

Ah =1 if A is the base at position 14 on the sense strand, otherwise its 
15 value is 0; 

A19 = 1 if A is the base at position 19 on the sense strand, otherwise-its 
value is 0; 

U5 = 1 if U is the base at position 5 on the sense strand, otherwise its 
value is 0; 

20 Uio= lif U is the base at position 10 on the sense strand, otherwise its 

value is 0; 

or, 

25 Formula VIII: (-14)*Gi3-13*Ai-12*U 7 -ll*U2-10*Aii-10*U4-10*C3-10*C5-10*C6- 

9*Aio-9*U 9 -9*C, g -8*G,o-7*U,-7*Ui6-7*C,7-7*Ci9 
+7*Ui7+8*A 2 +8*A4+8*A5+8*C4+9 !i! G8+10*A 7 +10*U 1 8+ll*A,9+ 
11*C 9 +15*G,+ 18*A 3 +19*Uio-Tm-3* (GC to tai) -6*(GCi 5 _ 19 )- 
30*X; and 

30 

Formula IX: (14.1)*A 3 +(14.9)*A 6 +(17.6)*Ai3+(24.7)*Ai9+(14.2)*Uio+(10.5)* 

C 9 +(23.9)*Gi+(16.3)*G2+(-12.3)*Aii+(-19.3)*U 1 +(-12.1)*U 2 + 
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(-ll)W(-15.2)nJis+(^ 

10.5)*Ct+ (-13 J)*G 13 +(-25.9)*G 19 -Tm-3* (GC tot ai) - 6*(GCi 5 -i 9 )- 
30*X 

wherein 

5 Ai = 1 if A is the base at position 1 of the sense strand, otherwise its value is 0; 

A2 = 1 if A is the base at position 2 of the sense strand, otherwise its value is 0; 

A3 = 1 if A is the base at position 3 of the sense strand, otherwise its value is 0; 

A4 = 1 if A is the base at position 4 of the sense strand, otherwise its value is 0; 

A5 = 1 if A is the base at position 5 of the sense strand, otherwise its value is 0; 
10 Ag = 1 if A is the base at position 6 of the sense strand, otherwise its value is 0; 

A7= 1 if A is the base at position 7 of the sense strand, otherwise its value is 0; 

A10 = 1 if A is the base at position 10 of the sense strand^ otherwise its value is 0; 

An - 1 if A is the base at position 1 1 of the sense strand, otherwise its value is 0; 

An = 1 if A is the base at position 13 of the sense strand, otherwise its value is 0; 
1 5 A19 = 1 if A is the base at position 1 9 of the sense strand, otherwise if another base 

is present or the sense strand is only 18 base pairs in length, its value is 0; 

C3 = 1 if C is the base at position 3 of the sense strand, otherwise its value is 0; 

C4 = 1 if C is the base at position 4 of the sense strand, otherwise its value is 0; 
20 C5 = 1 if C is the base atposition 5 of the sense strand, otherwise its value is 0; 

Cs = 1 if C is the base at position 6 of the sense strand, otherwise its value is 0; 

C7 = 1 if C is the base at position 7 of the sense strand, otherwise its value is 0; 

C9 — 1 if C is the base at position 9 of the sense strand, otherwise -its .value is 0; 

C17 = 1 if C is the base at position 17 of the sense strand, otherwise its value is 0; 
25 Cis = 1 if C is the base at position 1 8 of the sense strand, otherwise its value is 0; 

C19 = 1 if C is the base at position 19 of the sense strand, otherwise if another base 
is present or the sense strand is only 18 base pairs in length, its value is 0; 

Gi = 1 if G is the base at position 1 on the sense strand, ptherwise its value is 0; 
30 G2 = 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 
Gg = 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 
G\o= 1 if G is the base at position 10 on the sense strand, otherwise its value is 0; 
G13 = 1 if G is the base at position 1 3 on the sense strand, otherwise its value is 0; 
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G19 = 1 if G is the base at position 19 of the sense strand, otherwise if another base 
is present or the sense strand is only 1 8 base pairs in length, its value is 0; 

U] = 1 if U is the base at position 1 on the sense strand, otherwise its value is 0; 
5 U 2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0; 
U 3 = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0; 
U4 = 1 if U is the base at position 4 on the sense strand, otherwise its value is 0; 
U 7 = 1 if U is the base at position 7 on the sense strand, otherwise its value is 0; 
U9 = 1 if U is the base at position 9 on the sense strand, otherwise its value is 0; 
10 U10 = 1 if U is the base at position 10 on the sense strand, otherwise its value is 0; 
U15 = 1 if U is the base at position 15 on the sense strand, otherwise its value is 0; 
Uie = 1 if U is the base at position 16 on the sense strand,' otherwise its value is 0; 
Ui7 = 1 if U is the base at position 17 on the sense strand, otherwise its value is 0; 
Uig = 1 if.U is the base at position 1 8 on the sense strand, otherwise its value is 0; 

15 

GCi5_i9 = the number of G and C bases within positions 15 - 19 of the sense strand 

or within positions 1 5 -1 8 if the sense strand is only 1 8 base pairs in length; 
GCtotai— the number of G and C bases in the sense strand; 
Tm ~ 100 if the targeting site contains an inverted repeat longer then 4 base pairs, 
20 otherwise its value is Q^iand 

X = the number of times that the same nucleotide repeats four or more times in a row. 
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